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VIRULENCE-ASSOCIATED ADHESINS 

1 documents cited herein are incorporated by reference in their entirety. 



TECHNICAL FIELD 

This invention is in the field of bacterial adhesion. In particular, it relates to virulence-related 
5 adhesion antigens derived from Haemophilus influenzae, Escherichia coli and other organisms. 

BACKGROUND ART 

The Gram negative Haemophilus genus includes H. influenzae, Haegyptius (also referred to as 
HAnjluenzae biogroup aegyptius), Hdecreyi and Hsomnus. These bacteria can cause diseases 
including conjunctivitis, chancroid, purpuric fever, meningitis, pneumonia and epiglottitis. 
10 HAnjluenzae is the most commonly-found pathogen in this genus, and includes both typeable 
(encapsulated) and non-typeable (non-capsulated; 'NTHi') strains. 

A vaccine against Hinfluenzae type B ('Hib') based on a conjugate of its capsular saccharide and a 
carrier protein has been enormously successful, but there has been little progress in providing 
protection against other members of the species. In particular, type D Hinfluenzae and non-typeable 
15 H influenzae remain problematic. 

Similarly, vaccines remain unavailable for other bacterial pathogens such as enterotoxigenic (ETEC), 
enteropathogenic (EPEC), enteroaggregative (EAEC), enterohemorrhagic (EHEC) and shiga-toxic 
(STEC) strains of Escherichia coli. 

It is an object of the invention to provide materials and methods to improve the prevention and 
20 treatment of infections caused by such bacteria. More particularly, it is an object of the invention to 
provide materials suitable for immunising against bacterial infections. 

DISCLOSURE OF THE INVENTION 

Virulence-associated antigens involved in adhesion have been identified in several bacteria, and 
these antigens are useful for the diagnosis, prevention and treatment of bacteria! infections 

25 (particularly those caused by virulent strains). In particular, antigens have been identified in: 
Haemophilus influenzae biogroup aegyptius (SEQ ID NO: 1); Escherichia coli Kl (SEQ ED NO s : 2 
& 3) and also in EHEC strain EDL933; Actinobacillus actinomycetemcomitans (SEQ ID NO: 4); 
Haemophilus somnus (SEQ ID NO: 5); Haemophilus ducreyi (SEQ ID NO: 6); EPEC E.coli strain 
E2348/69 (SEQ ID NO s : 7 & 29); EAEC E.coli strain 042 (SEQ ID NO s : 8 & 9); uropathogenic 

30 E.coli (SEQ ID NO: 10); Shigella flexneri (SEQ ED NO: 1 1); Brucella melitensis (SEQ ID NO: 12); 
Brucella suis (SEQ ID NO: 13); Ralstonia solanacearum (SEQ ID NO: 14); Sinorhizobium meliloti 
(SEQ ED NO: 15); Bradorhizobium japonicum (SEQ ID NO: 16); and Burkholderia fimgorum (SEQ 
ID NO: 17). 



^Jthough the degree of sequence identity between the antigens of the invention is low, an 
appreciation of the antigens at a level beyond simple primary sequence information shows that they 
share a common arrangement of domains from N-terminus to C-terminus, namely: 

• a leader peptide 
5 o a globular head 

• a coiled-coil region 

• a transmembrane anchor region 

Sequence similarity between the various antigens is largely restricted to the C-terminal anchor 
region. This arrangement of domains is shared with N. meningitidis protein NadA { 1 } . 

10 The positions of these features in SEQ ID NO s : 1 to 1 8 are as follows: 



SEQID 


Organism | 


Length ) 


Leader 


Head 


Coiled-coil 


Anchor | 


1 


H.aegyptius | 


>223 


1-26 


27-55 


56-184 


185... 


2 


FHFC V 


338 


1-23 


24-207 


208-266 1 


267-338 


3 


J— JL JL1— <V> 1 


1588 


1-53 


54-1515* | 


1516-1588 


4 


A.actinomycetemcomitans \ 


295 


1-25 


26-150 


151-222 


223-295 


5 


H.somnus \ 


452 


1-26 


27-158 


. 159-378 1 


379-452 


6 1 


H. ducreyi 


273 1 


1-21 


22-198 * | 


199-273 


7 


EPEC 


338 


1-24 


25-209 


210-266 1 


267-338 


8 


EAEC 


717 


1-23 


24-109 


110-645 1 


646-717 


9 


1743 


1-53 


54-1670* 


1671-1743 


10 


UPEC 


1778 


i 1-53 


54-1705 * 


1706-1778 


11 


1 S.flexneri 


990 


1-917 * 


918-990 


12 


1 B.melitensis 


227 


1 1-27 


28-122 


123-154 


155-227 


13 


1 B.suis 


311 


1 1-27 


28-206 


207-238 


239-311 


14 


1 Rsolcmacearum 


1309 


| 1-230 * 


231-708 


1239-1309 


15 


S.meliloti 


1291 


| 1-1219* 


1220-1291 


16 


1 B.japonicum 


372 


1 1-72 


73-300 * 


301-372 


17 


B.fungornm 


3399 


1 1-57 


58-3328 * 


1 3329-3399 


18 


1 EPEC 


577 


| 1-504' 


505-577 1 



* The boundary between domains is less distinct for some polypeptides of the invention 



Antigens 

The invention provides a polypeptide comprising one or more of the following amino acid 
sequences: SEQ ID NO s : 1 to 18. 

15 The invention also provides a polypeptide comprising an amino acid sequence: (a) having at least 
m% identity to one or more of SEQ ED NO s : 1-18, where m is 50 or more {e.g. 60, 65, 70, 75, 80, 85, 
90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 99.5 or more); and/or (b) which is a fragment of at least n 
consecutive amino acids of one or more of SEQ ID NO s : 1-18, wherein n is 7 or more (e.g. 8, 10, 12, 



^p, 16, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250 or more). These polypeptides 
include variants (e.g. allelic variants, homologs, orthologs, paralogs, mutants, etc.) of SEQ ID NO s : 
1-18. 

Preferred fragments of (b) comprise an epitope from one or more of SEQ ID NO s : 1-18, preferably a 
5 B-cell epitope. B-celi epitopes can be identified empirically or can be predicted algorithmically. 

Other preferred fragments of (b) lack one or more amino acids (e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 
25 or more) from the C-terminus and/or one or more amino acids (e.g. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 
20, 25, 45 or more) from the N-terminus of the relevant amino acid sequence from SEQ ID 
NO s : 1-18. In particular, preferred fragments omit at least the N-terminus leader sequence. 

10 Other preferred fragments omit one or more (i.e. 1, 2, or 3) of the four domains of SEQ ID 
NO 5 : 1-18, based on the above table. Other preferred fragments consist of one or more (i.e. 1, 2, or 3) 
of the four domains of SEQ ID NO s : 1-18. 

Preferred polypeptides of the invention are presented in oligomeric form (e.g. dimers, trimers, 
tetramers, etc.). Trimers are preferred, but monomeric polypeptides of the invention are also useful. 

1 5 The invention also provides polypeptides of the formula NH 2 -A-{-X-L-} jr B-COOH, wherein: 

- X comprises an amino acid sequence: (a) having at least m% identity to one or more of SEQ 
ID NO s : 1-18; and/or (b) which is a fragment of at least n consecutive amino acids of one or 
more of SEQ ID NO s : 1-18, as defined above; 

L is an optional linker amino acid sequence; 
20 A is an optional N-terminal amino acid sequence; 

B is an optional C-terminal amino acid sequence; and 

- x is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 or 18 (preferably x=2). 

Where a -X- moiety has a leader peptide, this may be included or omitted in the hybrid protein. In 
some embodiments, the leader peptides will be deleted except for that of the -X- moiety located at 

25 the N-terminus of the hybrid protein i.e. the leader peptide of Xi will be retained, but thr? r 2ade:; 
peptides of X2 ... X x will be omitted. This is equivalent to deleting all leader peptides and using the 
leader peptide of Xi as moiety -A-. 

For each x instances of {-X-L-}, -X- may be the same or different, and linker amino acid sequence 
-L- may be present or absent. For instance, when x=2 the hybrid may be ISH2-X1-L1-X2-L2-COOH, 
30 NHz-XrXz-COOHi NH 2 -X r L r X 2 -COOH, NH 2 OC r X 2 -I^-COOH, etc. Linker amino acid 
sequence(s) -L- will typically be short (e.g. 20 or fewer amino acids i.e. 19, 18, 17, 16, 15, 14, 13, 12, 
11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples comprise short peptide sequences which facilitate cloning, 
poly-glycine linkers (i.e. comprising Gly„ where n = 2, 3, 4, 5, 6, 7, 8, 9, 10 or more), and histidine 
tags (i.e. His,, where n = 3, 4, 5, 6, 7, 8, 9, 10 or more). Other suitable linker amino acid sequences 



• 



011 be apparent to those skilled in the art. A useful linker is GSGGGG (SEQ ED NO: 19), with the 
Gly-Ser dipeptide being formed from a BamHL restriction site, thus aiding cloning and manipulation, 
and the (Gly) 4 tetrapeptide being a typical poly-glycine linker. 

-A- is an optional N-terminal amino acid sequence. This will typically be short (e.g. 40 or fewer 
5 amino acids Le. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 18, 
17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include leader sequences to direct 
protein trafficking, or short peptide sequences which facilitate cloning or purification (e.g. histidine 
tags le. Hisft where h = 3, 4, 5 5 6, 7, 8, 9, 10 or more). Other suitable N-terminal amino acid 
sequences will be apparent to those skilled in the art. If Xi lacks its own N-terminus methionine, -A- 
10 is preferably an oligopeptide (e.g. with 1, 2, 3, 4, 5, 6, 7 or 8 amino acids) which provides a 
N-terminus methionine. 

-B- is an optional C-terminal amino acid sequence. This will typically be short (e.g. 40 or fewer 
amino acids i.e. 39, 38, 37, 36, 35, 34, 33, 32, 31, 30, 29, 28, 27, 26, 25, 24, 23, 22, 21, 20, 19, 18, 
17, 16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1). Examples include sequences to direct protein 
15 trafficking, short peptide sequences which facilitate cloning or purification (e.g. comprising histidine 
tags le. His/, where h = 3, 4, 5, 6, 7, 8, 9, 10 or more), or sequences which enhance protein stability. 
Other suitable C-terminal amino acid sequences will be apparent to those skilled in the art. 

The invention also provides polypeptides comprising the amino acid sequence: 



20 




25 



A is an optional sequence as defined above (preferably at the N-terminus of the polypeptide); 
B is an optional sequence as defined above (preferably at the C-terminus of the polypeptide); 
Wi is an optional amino acid sequence: (a) having at least m% identity to the leader peptide 
of one or more of SEQ ID NO s : 1-18; and/or (b) which is a fragment of at least n consecutive 
amino acids of the leader peptide of one or more of SEQ ID NO s : 1-18; 



W 2 is an optional amino acid sequence: (a) having at least m% identity to the globular head 
domain of one or more of SEQ ID NO s : 1-18; and/or (b) which is a fragment of at least n 
consecutive amino acids of the globular head domain of one or more of SEQ ID NO : 1-18; 



30 



W3 is an optional amino acid sequence: (a) having at least m% identity to the coiled-coil 
domain of one or more of SEQ ID NO s : 1-18; and/or (b) which is a fragment of at least n 
consecutive amino acids of the coiled-coil domain of one or more of SEQ ID NO : 1-18; 



W4 is an optional amino acid sequence: (a) having at least m% identity to the transmembrane 



1 



35 



anchor region of one or more of SEQ ID NO s : 1-18; and/or (b) which is a fragment of at least 
n consecutive amino acids of the transmembrane anchor region of one or more of SEQ ID 
NO s : 1-18; 



provided that at least one of Wi, W2, W3 or W4 is present. 
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invention also provides a polypeptide comprising a polypeptide as described above, wherein the 
amino acid sequence of the polypeptide contains one or more amino acid mutations. The mutation(s) 
preferably result in the reduction or removal of an activity of a polypeptide of the invention which is 
responsible directly or indirectly for virulence or adhesion. For example, the mutation may inhibit an 
5 enzymatic activity or may remove a binding site in the protein. Mutation may involve deletion 
substitution, and/or insertion, any of which may be involve one or more amino acids. As an 
alternative, the mutation may involve truncation. 

Mutagenesis of virulence factors is a well-established science for many bacteria {e.g. toxin 
mutagenesis described in refs. 2 to 8}. Mutagenesis may be specifically targeted to nucleic acid 
10 encoding a polypeptide of the invention. Alternatively, mutagenesis may be global or random (e.g. 
by irradiation, chemical mutajgenesis, etc.), which will typically be followed by screening bacteria for 
those in which a mutation has been introduced into a gene encoding a polypeptide of the invention. 
Such screening may be by hybridisation assays (e.g. Southern or Northern blots etc.), primer-based 
amplification (e.g. PCR), sequencing, proteomics, aberrant SDS-PAGE gel migration, etc. 

15 Polypeptides of the invention can be prepared by various means (e.g. recombinant expression, 
purification from cell culture, chemical synthesis, etc.) and in various forms (e.g. native, fusions, 
non-glycosylated, lipidated, etc.). They are preferably prepared in substantially pure form (i.e. 
substantially free from other bacterial or host cell proteins). 

Whilst expression of the polypeptides of the invention may take place in the native host, the 
20 invention preferably utilises a heterologous host. The heterologous host may be prokaryotic (e.g. a 
bacterium) or eukaryotic. It is preferably E.coli, but other suitable hosts include Bacillus subtilis, 
Vibrio cholerae, Salmonella typhi, Salmonella typhimurium, Neisseria lactamica, Neisseria cinerea, 
Mycobacteria (e.g. M.tuberculosis), yeasts, etc. 

Antibodies 

25 The invention also provides antibodies which bind to polypeptides of the invention. 

Antibody of the invention preferably has an affinity for a polypeptide of the invention of a!; least 
10' 7 M e.g. 10~ 8 M, 10" 9 M, 10' 10 M or tighter. Preferred antibodies can block the ability of a 
polypeptide of the invention to bind to a human cell. 

Antibodies of the invention may be polyclonal or monoclonal and may be produced by any suitable 
30 means (e.g. by recombinant expression, purification from cell culture, chemical synthesis, etc.) and 
in various forms (e.g. native, fusions, glycosylated, non-glycosylated, etc.). They are preferably 
prepared in substantially pure form (i.e. substantially free from other antibodies). 

The term "antibody" includes whole antibodies, Fv, scFv, Fc, Fab, F(ab f )2, etc. 



^rtibodies of the invention may include a label. The label may be detectable directly, such as a 
radioactive or fluorescent label. Alternatively, the label may be detectable indirectly, such as an 
enzyme whose products are detectable (e.g. luciferase, p-galactosidase, peroxidase, etc.). 

Antibodies of the invention may be attached to a solid support. 

5 Antibodies of the invention may be prepared by administering (e.g. injecting) a polypeptide of the 
invention to an appropriate animal (e.g. a rabbit, hamster, mouse or other rodent). 

To increase compatibility with the human immune system, the antibodies may be chimeric or 
humanized {e.g. refs. 9 & 10}, or fully human antibodies may be used. Because humanized 
antibodies are far less immunogenic in humans than the original non-human monoclonal antibodies, 
10 they can be used for the treatment of humans with far less risk of anaphylaxis. Thus, these antibodies 
may be preferred in therapeutic applications that involve in vivo administration to a human such as, 
use as radiation sensitizers for the treatment of neoplastic disease or use in methods to reduce the 
side effects of cancer therapy. 

Humanized antibodies may be achieved by a variety of methods including, for example: (1) grafting 
15 non-human complementarity determining regions (CDRs) onto a human framework and constant 
region ("humanizing"), with the optional transfer of one or more framework residues from the 
non-human antibody; (2) transplanting entire non-human variable domains, but "cloaking" them with 
a human-like surface by replacement of surface residues ("veneering"). In the present invention, 
humanized antibodies will include both "humanized" and "veneered" antibodies. {11, 12, 13, 14, 15, 
20 16, 17}. Humanized or fully-human antibodies can also be produced using transgenic animals that 
are engineered to contain human immunoglobulin loci. 

The phrase "constant region" refers to the portion of the antibody molecule that confers effector 
functions. In chimeric antibodies, mouse constant regions are substituted by human constant regions. 
The constant regions of humanized antibodies are derived from human immunoglobulins. The heavy 
25 chain constant region can be selected from any of the 5 isotypes: alpha, delta, epsilon, gamma or mu. 

Nzcdeic acids 

The invention also provides nucleic acid encoding the polypeptides of the invention. Furthermore, 
the invention provides nucleic acid which can hybridise to this nucleic acid, preferably under "high 
stringency" conditions (e.g. 65°C in a O.lxSSC, 0.5% SDS solution). 

30 Nucleic acid according to the invention can be prepared in many ways (e.g. by chemical synthesis, 
from genomic or cDNA libraries, from the organism itself, etc.) and can take various forms (e.g. 
single stranded, double stranded, vectors, probes, etc.). They are preferably prepared in substantially 
pure form (i.e. substantially free from other bacterial or host cell nucleic acids). 




6- 




5 



10 



15 



20 



25 
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03 term "nucleic acid" includes DNA and RNA, and also their analogues, such as those containing 
modified backbones (e.g. phosphorothioates. etc.), and also peptide nucleic acids (PNA), etc. The 
invention includes nucleic acid comprising sequences complementary to those described above (e.g. 
for antisense or probing purposes). 

Immunogenic compositions and medicaments 

Based on the structural and functional similarities to NadA, which is a good anti-meningococcal 
immunogen {1}, including their association with virulence, the polypeptides of the invention should 
also be useful for immunisation purposes. 

The invention provides a composition comprising a polypeptide and/or a nucleic acid and/or an 
antibody of the invention. Compositions of the invention are preferably immunogenic compositions, 
and are more preferably vaccine compositions. Vaccines according to the invention may either be 
prophylactic (i.e. to prevent infection) or therapeutic (i.e. to treat infection), but will typically be 
prophylactic. 

The pH of the composition is preferably between 6 and 8, preferably about 7. The pH may be 
maintained by the use of a buffer. The composition may be sterile and/or pyrogen-free. The 
composition may be isotonic with respect to humans. 

The invention also provides a composition of the invention for use as a medicament The 
medicament is preferably able to raise an immune response in a mammal (i.e. it is an immunogenic 
composition) and is more preferably a vaccine. 

The invention also provides the use of one or more (e.g. 2, 3, 4, 5, 6) of the polypeptides of the 
invention in the manufacture of a medicament for raising an immune response in a mammal. The 
medicament is preferably a vaccine. 

The invention also provides a method for raising an immune response in a mammal comprising the 
step of administering an effective amount of a composition of the invention. The immune response is 
preferably protective and preferably involves antibodies and/or cell-mediated immunity. The method 
may raise a booster response. 

The mammal is preferably a human. Where the vaccine is for prophylactic use, the human is 
preferably a child (e.g. a toddler or infant) or a teenager; where the vaccine is for therapeutic use, the 
human is preferably a teenager or an adult. A vaccine intended for children may also be administered 
to adults e.g. to assess safety, dosage, immunogenicity, etc. 

These uses and methods are preferably for the prevention and/or treatment of a disease caused by 
Haemophilus influenzae biogroup aegyptius, Escherichia coli (particularly EHEC, EAEC, ETEC, 
EPEC and UPEC strains), Actinobacillus actinomycetemcomitans, Haemophilus somnus', 
Haemophilus ducreyi, Shigella flexneri, Brucella melitensis, Brucella suis, Ralstonia solanacearum, 
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^^orhizobium meliloti, Bradorhizobium japonicum and Bwkholderia fungorum. Thus the invention 
is suitable for the prevention and/or treatment of diseases including: conjunctivitis, chancroid, 
purpuric fever, meningitis, pneumonia, epiglottitis, peri-implantitis, periodontal disease, gingivitis, 
bovine encephalitis^ arthritis, myocarditis, diarrhoea, ovine abortion, orchitis, undulant fever, porcine 
5 reproductive wastage, brucellosis, etc. 

One way of checking efficacy of therapeutic treatment involves monitoring bacterial infection after 
administration of the composition of the invention. One way of checking efficacy of prophylactic 
treatment involves monitoring immune responses against the polypeptides after administration of the 
composition. 

10 Compositions of the invention will generally be administered directly to a patient. Direct delivery 
may be accomplished by parenteral injection (e.g. subcutaneously, intraperitoneally, intravenously, 
intramuscularly, or to the interstitial space of a tissue), or by rectal, oral {e.g. tablet, spray), vaginal, 
topical, transdermal {e.g. see ref. 18} or transcutaneous {e.g. see refs. 19 & 20}, intranasal {e.g. see 
ref. 21}, ocular, aural, pulmonary or other mucosal administration. 

1 5 The invention may be used to elicit systemic and/or mucosal immunity. 

Dosage treatment can be a single dose schedule or a multiple dose schedule. Multiple doses may be 
used in a primary immunisation schedule and/or in a booster immunisation schedule. In a multiple 
dose schedule the various doses may be given by the same or different routes e.g. a parenteral prime 
and mucosal boost, a mucosal prime and parenteral boost, etc. 

20 Bacterial infections affect various areas of the body and so the compositions of the invention may be 
prepared in various forms. For example, the compositions may be prepared as injectables, either as 
liquid solutions or suspensions. Solid forms suitable for solution in, or suspension in, liquid vehicles 
prior to injection can also be prepared (e.g. a lyophilised composition). The composition may be 
prepared for topical administration e.g. as an ointment, cream or powder. The composition may be 

25 prepared for oral administration e.g. as a tablet or capsule, as a spray, or as a syrup (optionally 
flavoured). The composition may be prepared for pulmonary administration e.g. as an inhaler, using 
a fine powder or a spray. The composition may be prepared as a suppository or pessary. The 
composition may be prepared for nasal, aural or ocular administration e.g. as drops. The composition 
may be in kit form, designed such that a combined composition is reconstituted just prior to 

30 administration to a patient. Such kits may comprise one or more antigens in liquid form and one or 
more lyophilised antigens. 

Immunogenic compositions used as vaccines comprise an immunologically effective amount of 
antigen(s), as well as any other components, as needed. By 'immunologically effective amount', it is 
meant that the administration of that amount to an individual, either in a single dose or as part of a 
35 series, is effective for treatment or prevention. This amount varies depending upon the health and 
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^,'sical condition of the individual to be treated, age, the taxonomic group of individual to be treated 
(e.g. non-human primate, primate, etc.), the capacity of the individual's immune system to synthesise 
antibodies, the degree of protection desired, the formulation of the vaccine, the treating doctor's 
assessment of the medical situation, and other relevant factors. It is expected that the amount will fall 
5 in a relatively broad range that can be determined through routine trials. 

The invention also provides the polypeptides of the invention (including NadA itself) for use as 
adjuvants (parenteral and/or mucosal). Similarly, the invention provides a composition comprising a 
polypeptide of the invention in admixture with a second antigen, whereby the polypeptide of the 
invention enhances the immune response against the second antigen when administered to a patient. 

10 Further components of the composition 

The composition of the invention will typically, in addition to the components mentioned above, 
comprise one or more 'pharmaceutically acceptable carriers', which include any carrier that does not 
itself induce the production of antibodies harmful to the individual receiving the composition. 
Suitable carriers are typically large, slowly metabolised macromolecules such as proteins, 

15 polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid copolymers, 
and lipid aggregates (such as oil droplets or liposomes). Such carriers are well known to those of 
ordinary skill in the art. The vaccines may also contain diluents, such as water, saline, glycerol, etc. 
Additionally, auxiliary substances, such as wetting or emulsifying agents, pH buffering substances, 
and the like, may be present. A thorough discussion of pharmaceutically acceptable excipients is 

20 available in reference 22. 

Vaccines of the invention may be administered in conjunction with other immunoregulatory agents. 
In particular, compositions will usually include an adjuvant. Preferred further adjuvants include, but 
are not limited to: (A) aluminium salts, including hydroxides {e.g. oxyhydroxides), phosphates {e.g. 
hydroxyphoshpates, orthophosphates), sulphates, etc. {e.g. see chapters 8 & 9 of ref. 23}), or 
25 mixtures of different aluminium compounds, with the compounds taking any suitable form {e.g. gel, 
crystalline, amorphous, etc.), and with adsorption being preferred; (B) MF59 (5% Squalene, 0.5% 
Tween 80, and 0.5% Span 85, formulated into submicron particles using a microfluicizer) {see 
Chapter 10 of 23; see also ref. 24}; (C) liposomes {see Chapters 13 and 14 of ref. 23}; (D) ISCOMs 
{see Chapter 23 of ref. 23}, which may be devoid of additional detergent {25}; (E) SAF, containing 
30 10% Squalane, 0.4% Tween 80, 5% pluronic-block polymer L121, and thr-MDP, either 
microfluidized into a submicron emulsion or vortexed to generate a larger particle size emulsion {see 
Chapter 12 of ref. 23}; (F) Ribi™ adjuvant system (RAS), (Ribi Immunochem) containing 2% 
Squalene, 0.2% Tween 80, and one or more bacterial cell wall components from the group consisting 
of monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton (CWS), 
preferably MPL + CWS (Detox™); (G) saponin adjuvants, such as QuilA or QS21 {see Chapter 22 
of ref. 23}, also known as Stimulon™ {26}; (H) chitosan {e.g. 27}; (I) complete Freund's adjuvant 
(CFA) and incomplete Freund's adjuvant (IFA); (J) cytokines, such as interleukins {e.g. IL-1, IL-2, 



35 
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A-4, IL-5, JL-6, IL-7, EL-12, etc.), interferons (e.g. interferon-^), macrophage colony stimulating 
^tor, tumor necrosis factor, etc. {see Chapters 27 & 28 of ref. 23}; (K) monophosphoryl lipid A 
(MPL) or 3-O-deacylated MPL (3dMPL) {e.g. chapter 21 of ref. 23); (L) combinations of 3dMPL 
with, for example, QS21 and/or oil-in-water emulsions {28}; (M) a polyoxyethylene ether or a 

5 polyoxyethylene ester {29}; (N) a polyoxyethylene sorbitan ester surfactant in combination with an 
octoxynol {30} or a polyoxyethylene alkyl ether or ester surfactant in combination with at least one 
additional non-ionic surfactant such as an octoxynol {31}; (N) a particle of metal salt {32}; (O) a 
saponin and an oil-in-water emulsion {33}; (P) a saponin (e.g. QS21) + 3dMPL + IL-12 (optionally 
+ a sterol) {34}; (Q) Kcoli heat-labile enterotoxin ("LT"), or detoxified mutants thereof, such as the 
10 K63 or R72 mutants {e.g. Chapter 5 of ref. 35}; (R) cholera toxin ("CT"), or detoxified mutants 
thereof {e.g. Chapter 5 of ref. 35}; (S) double-stranded RNA; (T) microparticles (i.e. a particle of 
-lOOnm to ~150um in diameter, more preferably ~200nm to ~30um in diameter, and most preferably 
~500nm to ~10um in diameter) formed from materials that are biodegradable and non-toxic (e.g. a 
poly(a-hydroxy acid), a polyhydroxybutyric acid, a polyorthoester, a polyanhydride, a 
15 polycaprolactone, etc.), with poly(lactide-co-glycolide) being preferred, optionally treated to have a 
negatively-charged surface (e.g. with SDS) or a positively-charged surface (e.g. with a cationic 
detergent, such as CTAB); (U) oligonucleotides comprising CpG motifs i.e. containing at least one 
CG dinucleotide, with 5-methylcytosine optionally being used in place of cytosine; (V) 
monophosphoryl lipid A mimics, such as aminoalkyl glucosaminide phosphate derivatives e.g. 
20 RC-529 {36}; (W) polyphosphazene (PCPP); (X) a bioadhesive {37} such as esterified hyaluronic 
acid microspheres {38} or a mucoadhesive selected from the group consisting of cross-linked 
derivatives of poly(acrylic acid), polyvinyl alcohol, polyvinyl pyrollidone, polysaccharides and 
carboxymethylcellulose; or (Y) other substances that act as immunostimulating agents to enhance the 
effectiveness of the composition {e.g. see Chapter 7 of ref. 23}. Aluminium salts and MF59 are 
25 preferred adjuvants for parenteral immunisation. Mutant toxins are preferred mucosal adjuvants. 

Muramyl peptides include N-acetyl-muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl- 
normuramyl-L-alanyl-D-isoglutamine (nor-MDP), N-acerylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine- 
2-(l'-2'-dipalmitoyl-jn-glycero-3-hydroxyphosphoryloxy)-ethylamineMTP-PE), etc. 

The composition may include an antibiotic. 
30 Further antigens 

As well as containing polypeptides of the invention, the compositions of the invention may also 
include one or more further antigens. Further antigens for inclusion may be, for example: 

- a saccharide antigen from N. meningitidis serogroup A, C, W135 and/or Y, such as the 
oligosaccharide disclosed in ref. 39 from serogroup C {see also ref. 40} or the 

35 oligosaccharides of ref. 41. 

- antigens from Helicobacter pylori such as CagA {42 to 45}, VacA {46, 47}, NAP {48, 49, 
50}, HopX {e.g. 51}, HopY {e.g. 51} and/or urease. 

- a saccharide antigen from Streptococcus pneumoniae {e.g. 52, 53, 54}. 
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a protein antigen from Streptococcus pneumoniae {e.g. 55} . 

an antigen from hepatitis A virus, such as inactivated virus {e.g. 56, 57}. 

an antigen from hepatitis B virus, such as the surface and/or core antigens {e.g. 57, 58}. 

an antigen from hepatitis C virjs {e.g. 59}. 

a diphtheria antigen, such as a diphtheria toxoid {e.g. chapter 3 of ref. 60} e.g. the CRM197 
mutant {e.g. 61}. 

a tetanus antigen, such as a tetanus toxoid {e.g. chapter 4 of ref. 60}. 

an antigen from Bordetella pertussis, such as pertussis holotoxin (PT) and filamentous 
haemagglutinin (FHA) from B.pertussis, optionally also in combination with pertactin and/or 
agglutinogens 2 and 3 {e.g. refs. 62 & 63}; whole-cell pertussis antigen may also be used, 
a saccharide antigen from Haemophilus influenzae B {e.g. 40}. 
polio antigen(s) {e.g. 64, 65} such as OPV or, preferably, DPV. 

a protein antigen from N. meningitidis serogroup B {e.g. refs. 66 to 77}, such as NadA. 

an outer-membrane vesicle (OMV) preparation from N. meningitidis serogroup B, such as 

those disclosed in refs. 78, 79, 80, 81, etc. 

an antigen from Chlamydia pneumoniae {e.g. refs. 82 to 88}. 

an antigen from Chlamydia trachomatis {e.g. 89}. 

an antigen from Porphyromonas gingivalis {e.g. 90}. 

rabies antigen(s) {e.g. 91} such as lyophilised inactivated virus {e.g. 92, RabAvert™}. 

measles, mumps and/or rubella antigens {e.g. chapters 9, 10 & 1 1 of ref. 60}. 

influenza antigen(s) {e.g. chapter 19 of ref. 60}, such as the hemagglutinin and/or 

neuraminidase surface proteins. 

an antigen from gonorrhoeae {e.g. 93, 94, 95, 96}. 

antigen(s) from a paramyxovirus such as respiratory syncytial virus (RSV {97, 98}) and/or 
parainfluenza virus (PIV3 {99}). 

an antigen from Moraxella catarrhalis {e.g. 100}, such as UspAl and/or UspA2 

an antigen from Streptococcus pyogenes (group A streptococcus) {e.g. 101, 1 02, 103}. 

an antigen from Streptococcus agalactiae (group B streptococcus) {e.g. 104}. 

an antigen from Staphylococcus aureus {e.g. 105}. 

an antigen from Bacillus anthracis {e.g. 106, 107, 108}. 

an antigen from a virus in the flaviviridae family (genus flavivirus), such as from yellow 

fever virus, Japanese encephalitis virus, four serotypes of Dengue viruses, tick-borne 

encephalitis virus, West Nile virus. 

an antigen from Pseudomonas. 

an antigen from a HIV e.g. a HIV-1 or HIV-2. 

an antigen from a rotavirus. 
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a pestivirus antigen, such as from classical porcine fever virus, bovine viral diarrhoea virus, 
and/or border disease virus. 

- a parvovirus antigen e.g. from parvovirus B19. 

- a coronavirus antigen e.g. from the SARS coronoavirus. 

- a cancer antigen, such as those listed in Table 1 of ref. 109 or in tables 3 & 4 of ref. 1 10. 

The composition may comprise one or more of these further antigens. It is preferred that 
combinations of antigens should be based on shared characteristics e.g. antigens associated with 
respiratory diseases, antigens associated with enteric diseases, antigens associated with sexually- 
transmitted diseases, etc. 

Where a saccharide or carbohydrate antigen is used, it is preferably conjugated to a carrier protein in 
order to enhance immunogenicity {e.g. refs. Ill to 120}. Preferred carrier proteins are bacterial 
toxins or toxoids, such as diphtheria or tetanus toxoids. The CRM197 diphtheria toxoid is particularly 
preferred {121}. Other carrier polypeptides include the N. meningitidis outer membrane protein 
{122}, synthetic peptides {123, 124}, heat shock proteins {125, 126}, pertussis proteins {127, 128}, 
protein D from H. influenzae {129}, cytokines {130}, lymphokines {130}, hormones {130}, growth 
factors {130}, toxin A or B from C.difficile {131}, iron-uptake proteins {132}, etc. Where a mixture 
comprises capsular saccharides from both serogroups A and C, it may be preferred that the ratio 
(w/w) of MenA saccharide :MenC saccharide is greater than 1 (e.g. 2:1, 3:1, 4:1, 5:1, 10:1 or higher). 
Different saccharides can be conjugated to the same or different type of carrier protein. Any suitable 
conjugation reaction can be used, with any suitable linker where necessary. 

Toxic protein antigens may be detoxified where necessary e.g. detoxification of pertussis toxin by 
chemical and/or genetic means {63}. 

Where a diphtheria antigen is included in the composition it is preferred also to include tetanus 
antigen and pertussis antigens. Similarly, where a tetanus antigen is included it is preferred also to 
include diphtheria and pertussis antigens. Similarly, where a pertussis antigen is included it is 

preferred also to ir elude dipk A herir, ard tetanus ant^ens. 

Antigens in the composition will typically be present at a concentration of at least lp,g/ml each. In 
general, the concentration of any given antigen will be sufficient to elicit an immune response against 
that antigen. 

As an alternative to using protein antigens in the composition of the invention, nucleic acid encoding 
the antigen may be used {e.g. refs. 133 to 141}. Protein components of the compositions of the 
invention may thus be replaced by nucleic acid (preferably DNA e.g. in the form of a plasmid) that 
encodes the protein. 
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^^cesses 

The invention also provides a process for producing a polypeptide of the invention, comprising the 
step of culturing a host cell transformed with nucleic acid of the invention under conditions which 
induce polypeptide expression. 

The invention provides a process for producing a polypeptide of the invention^ comprising the step of 
synthesising at least part of the polypeptide by chemical means. 

The invention provides a process for producing nucleic acid of the invention, comprising the step of 
amplifying nucleic acid using a primer-based amplification method (e.g. PCR). 

The invention provides a process for producing nucleic acid of the invention, comprising the step of 
synthesising at least part of the nucleic acid by chemical means. 

The invention also provides a process for detecting the presence of a bacterium in a sample, 
comprising the step of contacting the sample with nucleic acid of the invention under hybridizing 
conditions; and (b) detecting the presence or absence of hybridization of nucleic acid of the invention 
to nucleic acid present in the sample. The presence of hybridization in step (b) indicates that the 
sample contains the relevant bacterium. 

The invention also provides an immunoassay method for detecting the presence of a bacteirum, 
comprising the step of contacting a sample with a polypeptide or antibody of the invention. 

Adhesion inhibition 

The invention provides methods for inhibiting the attachment of bacterial cells to host cells (e.g. 
human cells). The cell may be in vitro (e.g. in cell culture) or in vivo. The cells are most preferably 
human cells. The host cells will typically be epithelial or endothelial cells. 

The invention provides a method for preventing the attachment of a bacterial cell to a host cell, 
wherein the ability of one or more of the polypeptides of the invention to bind to the host cell is 

blocked. 

The ability to bind may be blocked in various ways but, most conveniently, an antibody specific for a 
polypeptide of the invention is used. As an alternative to using antibodies, antagonists of the 
interaction between the polypeptide of the invention and its receptor on the host cell may be used. As 
a further alternative, a soluble form of the host cell receptor may be used as a decoy. These can be 
produced by removing the receptor's transmembrane and, optionally, cytoplasmic regions. 

The antibodies, antagonists and soluble receptors of the invention may be used as medicaments to 
prevent the attachment of a bacterial cell to a host cell. 

The invention provides a method for preventing the attachment of a bacterial cell to a host cell, 
wherein expression of a polypeptide of the invention is inhibited. The inhibition may be at the level 
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^transcription and/or translation. A preferred technique for inhibiting expression of the gene is 
antisense {e.g. refs. 142 to 148, etc.}. Antibacterial antisense techniques are disclosed in, for 
example, references 149 & 150. 

The invention provides a method for preventing the attachment of a Neisserial cell to an epithelial 
5 ceil, wherein the gene encoding the polypeptide of the invention is knocked out. Thus the invention 
provides a bacterium in which such genes have been knocked out. Techniques for producing 
knockout bacteria are well known. The knockout mutation may be situated in the coding region of 
the gene or may lie within its transcriptional control regions {e.g. within its promoter). The knockout 
mutation will reduce the level of mRNA encoding a polypeptide of the invention to <1% of that 
1 0 produced by the wild-type bacterium e.g. <0.5%, <0.1 %, 0%. The knockout mutants of the invention 
may be used as immunogenic compositions (e.g. as vaccines). Such a vaccine may include the 
mutant as a live attenuated bacterium. 

The invention also provides methods for screening compounds to identify those (antagonists) which 
inhibit the binding of a bacterial cell to a host cell. 

1 5 Potential antagonists for screening include small organic molecules, peptides, peptoids, polypeptides, 
lipids, metals, nucleotides, nucleosides, polyamines, antibodies, and derivatives thereof. Small 
organic molecules have a molecular weight between 50 and about 2,500 daltons, and most preferably 
in the range 200-800 daltons. Complex mixtures of substances, such as extracts containing natural 
products, compound libraries or the products of mixed combinatorial syntheses also contain potential 

20 antagonists. 

Typically, a polypeptide of the invention is incubated with a host cell and a test compound (e.g. an 
antibody), and the mixture is then tested to see if the interaction between the protein and the 
epithelial cell has been inhibited. The protein, cell and compound may be mixed in any order. 

Inhibition will, of course, be determined relative to a standard (e.g. the native protein/cell 
25 interaction). Preferably, the standard is a control value measured in the absence of the test compound. 
It will be appreciated that the standard may have been determined before performing the method, or 
may be determined during or after the method has been performed. It may also be an absolute 
standard. 

For preferred high-throughput screening methods, all the biochemical steps for this assay are 
30 performed in a single solution in, for instance, a test tube or microtitre plate, and the test compounds 
are analysed initially at a single compound concentration. For the purposes of high throughput 
screening, the experimental conditions are adjusted to achieve a proportion of test compounds 
identified as "positive" compounds from amongst the total compounds screened. 
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method may also simply involve incubating one or more test compound(s) with a polypeptide of 
the invention and determining if they interact. Compounds that interact with the protein can then be 
tested for their ability to block an interaction between the protein and an epithelial cell. 

Other methods which may be used include, for example, reverse two hybrid screening {151} ?r. 
5 which the inhibition of the bacteria:host receptor interaction is reported as a failure to activate 
transcription. 

The invention also provides a compound identified using these methods. These can be used to treat 
or prevent bacterial infection. The compound preferably has an affinity for App, ORF40 and/or 
NadA of at least 10" 7 M e.g. 10" 8 M, 10" 9 M, 10" 10 M or tighter. 

1 0 Definitions 

The term "comprising" means "including" as well as "consisting" e.g. a composition "comprising" X 
may consist exclusively of X or may include something additional e.g. X + Y. 

The term "about" in relation to a numerical value x means, for example, x±lQ%. 

References to a percentage sequence identity between two amino acid sequences means that, when 
15 aligned, that percentage of amino acids are the same in comparing the two sequences. This alignment 
and the percent homology or sequence identity can be determined using software programs known in 
the art, for example those described in section 7.7.18 of reference 152. A preferred alignment is 
determined by the Smith-Waterman homology search algorithm using an affine gap search with a 
gap open penalty of 12 and a gap extension penalty of 2, BLOSUM matrix of 62. The Smith- 
20 Waterman homology search algorithm is disclosed in reference 153. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 to 15 show analyses of amino acid sequences of the invention to show coiled-coil regions. 

Figure 16 shows conservation between anchor regions of polypeptides of the invention. 

Figure 17 is an illustration of the NadA structure within the meningococcal outer membrane, in 
25 monomeric and trimeric form. 

Figures 1 8 & 19 show comparisons of the genetic environment of genes encoding polypeptides of the 
invention. Figure 20 illustrates the genetic environment in E.coli Kl vs. K12. 

MODES FOR CARRYING OUT THE INVENTION 

Neisseria meningitidis NadA protein 
30 Within the Neisseria meningitidis serogroup B genome {75}, an outer membrane protein (NadA) was 
identified {1} which shows weak homology to Yersinia enterocolitica adhesin YadA and to 
Moraxella catarrhalis surface protein UspA2 {154}. The nadA gene is present in a subgroup of 
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^^ervirulent Kmeningitidis strains and is characterized by a low GC content, which suggests a 
probable acquisition event of the gene by horizontal transfer. 

To investigate the. possibility that proteins similar to the NadA adhesin could have been acquired by 
other pathogens, we searched for homologous proteins. 

5 A sequence alignment of NadA & YadA revealed that the two proteins are most similar at the 
C-terminus, which is the membrane anchor domain. In NadA, this domain is approximately 70 
residues long and contains five predicted amphipatic beta strands, which cross the outer membrane 
multiple times thus anchoring the protein to the surface of the bacterium (Figure 17). Within this 
region, the level of sequence similarity between NadA & YadA is around 60% identity while in the 

1 0 N-terminal and central domain the homology is below 25% identity. 

In a first search, based on the NadA anchor domain, results included YadA and UspA2, but also 
other proteins, such as the serum resistance protein DsrA of Haemophilus ducreyi, the 
immunoglobulin binding proteins EibA-C-D-E and F of E.coli, and the outer membrane protein 100 
of Aclinobacillus actinomycetemcomitans {154}. In order to highlight more distant members of this 

15 family, these results were used for further searches, and this approach identified 16 further results. 
These 16 polypeptides were further evaluated for secondary structure analysis, coiled coil prediction 
and presence/absence of a leader peptide. As expected, despite the little amino acid similarity 
displayed within the central regions, most of the identified polypeptides possess the coiled coil 
feature, which gives them the capability to form stable oligomers. The anchor regions of the 

20 identified polypeptides are well conserved (Figure 16). In addition, the GC content of the genes 
encoding these polypeptides was lower than average for their respective genomes, suggesting that 
they are encoded by genes carried on mobile genetic elements. 

Escherichia colt 

Polypeptides were found in pathogenic strains of E.coli, including enteropathogenic (EPEC), 
25 enteroaggregative (EAEC), enterohemorragic (EHEC) and uropathogenic (UPEC) strains. 

Furthermore, a polypeptide almost identical to those of the BIT1C and 3?BC straps was found in the 
Kl strain, which is a capsulated Rcoli strain responsible for neonatal meningitis. The Kl sequence 
aligns with NadA as follows: 

100 110 120 130 140 150 

kl . pep TGWQIPARYQSMINARQSAVTDAQQTQITEQQAQIVATQKTLAATGDTQNTAHYQEMIN 

I : : : I I : : I I : : I 1 : : : I : I 
NadA. pep DAALADTD AALDETTNALNKLGEN ITT FAEETKTN I VKI DEKLE AVADTVD — KHAEAFN 
130 140 150 160 170 180 

160 170 180 190 200 210 

kl . pep ARI^QNEMQRTTTEQGQKMNALTTDVAAQQQKERAQYDKQMQSLAQKSVQAHEQIESL 

: : | : | | : : : ^ I : : : | : : : : I : : I I : I : : 

NadA . pep DI ADS LDETN — TKADEAVKTANEAKQTAEETKQNVDAKVKAAETAAGKAEAAAGTANTA 

190 200 210 . 220 230 240 

220 230 240 250 260 270 
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kl . pep RQDSAQTQQQLTNTQBCRVADNSQQINTLNNHFDSLKNEVEDNRKEANAGTASAIAIASQP 
W : : I : : : I I : : I : : | I | : : I : I I I : II I : : 

NadA. pep .ADKAEAVAAKVTDIKADIATNKADIAKNSARID 

250 260 270 280 290 300 

280 290 300 310 320 330 

kl . pep QVKTGDVMMVS AGAGT FNGE S AVS VGTS FNAGTHT VLKAGI S ADTQS D FGAGVGVG YS F 

I : : | : 1 : : | : : : | | I I : : I I : I : : I I I : : : I : I : I : II:: 

NadA. pep QP YNVGRFNVTAAVGGYKSE SAVAIGTGFRFTENFAAKAGVAVGT S SGS SAAYHVGVN YEW 

310 320 330 340 350 360 

24.4% identity in 209 aa overlap 

Another NadA analogue was encoded by the large virulence plasmid present in shiga toxigenic 
strains of E.coli (STEC) {155}. This protein (Saa) is expressed on the outer membrane of E.coli and 
forms high molecular weight oligomers. In contrast, no counterpart of NadA could be detected in the 
benign E.coli strain K12, supporting the view that these genes have been acquired by lateral 
exchange early during evolution of the species (Figure 20). Nor could a counterpart be seen in 
laboratory stein MG1655. 

Prompted by these observations, and in order to assess a possible mechanism of insertion/deletion of 
these genes, the arrangement of the region that harbours the gene coding for the NadA-like molecule 
was investigated. The sequence of this region for the EHEC strain is SEQ ID NO: 23 

This analysis showed that the gene organisation of the DNA segments is almost identical among the 
genomes of Kl, EHEC and EPEC, with a sequence conservation of the NadA-like proteins that 
ranges from 95% identity between Kl and EHEC to 98% identity between Kl arid EPEC. In the case 
of EAEC, although the flanking regions are conserved, the sequence of the NadA-like protein is 380 
residues longer than the others, even if the N-terminus and C-terminus are well conserved. 



Bacterium 


Amino acid 


Nucleic acid 


Figure 


Kcoli Kl & E.coli EHEC strain EDL933 


SEQ ID NO: 2 


SEQ ID NO: 22 


3 


E.coli EPEC strain E2348/69 


SEQ ID NO: 7 


SEQ ID NO: 24 




E.coli EAEC strain 042 


SEQ ID NO: 8 


SEQ ID NO: 25 
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Extending the analysis to the K12 genome, the insertion site was found to be between two 
hypothetical open reading frames (YbbJ and YbbI) coded on opposite strands, and that the small 
"island" consists of three genes: an ORF coding for an hypothetical integral membrane protein, the 
gene for the putative NadA-like adhesin, and an ORF for a predicted lipoprotein of unknown 
function. The two latter ORFs are probably co-transcribed, while the first one is coded on the reverse 
orientation. A couple of 7-bp direct repeats (CTGACGC) that could represent putative insertion sites 
could be mapped at the boundaries of the inserted fragments (SEQ ID NO: 23, starting at nucleotides 
181 1 & 4255), and this repeat is absent in the vicinities of the point of insertion in the K12 strain. 

The length of the acquired DNA regions is 2348 bases for EPEC, 2450 bases for Kl and EHEC, and 
2630 for EAEC (Figure 18). In all cases, the G+C content of the fragment is lower if compared to the 
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flerage composition calculated for each genome, thus confirming the preliminary hypothesis that 
is segment has been acquired by pathogenic Rcoli by a mechanism of lateral transfer. 

In the case of uropathogenic E.coli (UPEC), a different DNA segment was found between the ybbJ 
ad ybbl genes. This segment is 1 342 bp long and encodes a predicted cytoplasmic *rotc?r, which is 
5 conserved only in Salmonella typlrymurium LT2, but absent from all the other analyzed strains of 
E.coli. Differently from the other described insertion fragments, no direct repeats could be mapped at 
the boundaries of this island, whose GC composition is also very similar to the average value. These 
data could indicate that the NadA-like encoding gene has been inserted later on in place of the c0608 
gene. Nevertheless, subsequent search revealed that a gene coding for an homologue of NadA could 
10 be found in a different location of the genome of uropathogenic E.coli strain CFT073. This protein is 
more distantly related to NadA and is seen as a member of a second NadA-like family of proteins. 
Counterparts of this protein are contained in the other pathogenic strains of E.coli e± analogous 
locations and, similarly to the first group of E.coli NadA-like molecules, the corresponding genes are 
also encoded on small islands and are not present in the K12 strain (Figure 19). Furthermore, these 
15 genes have strong similarities at the 3 f end with a frame-shifted Shigella flexneri sequence. The 
arrangement of NLM flanking regions has been compared in the two species {E.coli and Shigella) 
revealing striking similarities. Although the sequence conservation is restricted to the amino and 
carboxy-terminal portions of the adhesin coding genes, the flanking regions are syntenic and share 
more than 80% identity at the nucleotide level. Upstream of the NadA-like gene, this island contains 
20 an ORF coding for a lipoprotein that is frameshifted either in EPEC, EHEC and in Shigella. 
Furthermore, in the genome of Shigella, two additional genes (insA and insB), coding for transposase 
elements are found in the vicinities of the NLM gene. 
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E.coli UPEC strain CFT073 


SEQ ID NO: 10 


SEQ ID NO: 26 
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E.coli EHEC 


SEQ ID NO: 3 


SEQ ID NO: 27 


6 


E.coli EAEC 


SEQ ID NO: 9 


SEQ ID NO: 28 


7 


E.coli EPEC 


SEQ ID NO: 18 


SEQ ID NO: 3 C 


8 


S.flexneri 


SEQ ID NO: 1 1 


SEQ ID NO: 31 


9 



Haemophilus 

25 An incomplete NadA homolog was found in Brazilian purpuric fever (BPF) Haemophilus influenzae 
isolates {156}. This polypeptide has been named HadA. NadA and HadA align as follows: 

10 20 30 40 

HadA . pep MKRNLLKQS VIAVLIGGTTVSNYALAQAQAQAQVKKDELSELKKQVKEM- 

• • i i i • • • i • • l * • i ••• • 

• a ' * 1**1 ••• • 

NadA. pep KT VNENKQN VDAKVKAAE SE IEKLTTKLADT DAALADT DAALDETTNALNKLGEN ITT FA 

100 110 120 130 140 150 

50 60 70 80 90 100 

HadA . pep DAAIDGILDDNIAYEAEVDAKLDQHSAALGRHTNRLNNLKTIAEKAKGDSSEALDKIE 

-18- 




: : : | : : | | : | : : I : I : I : : : : I : : : I I : : I : : I I : I 
NadA . pep EETKTN I VKI DEKLEAVADT-VDKHAE AFNDI ADSLDETNTKADEAVKTANE AKQTAEET 

160 170 180 190 200 210 

110 120 130 140 150 160 

HadA. pep EEQNDEFLADITALEEGVDGLDDDIAGIQDNISD IEDDINQNSADIATNTAAIATH 

: : : | | : ! 1 : : | : II : : : I : : : : : I I I I I I I I I : 

NadA . pep KQNVD AKVKAAET AA- GKAE AAAGTANT AADKAE AVAAKVT D IKADI A*l 1 NKADI AKN 

220 230 240 250 260 270 

170 180 190 200 210 220 

HadA . pep T QRLDN LDNRVNNLNKDLKRGLAAQAALNGL FQP YNVGKLN LTAAVGG YKS QT AVAVG ... 

: I : I : II : I I I I : : : I I I I I I I : I I I I I I I I I :: I : II I I I I I I I :: I I I : I 
NadA. pep SARIDSLDKNVANLRKETRQGLAEQAALSGLFQPYNVGRFNVTAAVGGYKSESAVAIGTG 

280 290 300 310 320 330 

NadA. pep FRFTEN FAAKAGVAVGT S S G S S AAYHVGVN YEW 

340 350 360 

No HadA counterpart could be detected either in non-typeable H.influenzae strain 86028, which is 

responsible for otitis medic in children, cr in the non-pathogenic HJnflnenzae strain Rd KY/2C. The 
very high level of sequence identity between HadA and NadA in the C-terminal anchor region, might 
indicate a common origin. 

In order to analyze the origin of the hadA gene, the nucleotide sequence of this DNA region in the 
BPF isolate (SEQ ID NO: 20) was compared to the same region in'fhfe genome sequence for 
H. influenzae strains: the non-pathogenic strain Rd {157}, and a non-typeable 86028 strain (NTHi 
86028), associated with pediatric otitis media disease. 

The results of this comparison indicate that the adhesin coding gene is specific for the Brazilian 
Purpuric Fever clone (strain F3031), while no counterparts could be mapped either in the laboratory 
Rd or in the non-typeable strains. The HadA-encoding fragment has an organization that closely 
resembles that described for NadA {1} and includes an intact open reading frame plus a 182 bp 
upstream region, which contains —10 and —35 promoter elements. The small genetic island is flanked 
by the RNA helicase gene at the 5' end and by a putative protease encoding gene located at the 3 ! 
end. The GC composition of the recombined segment is consistent with the rest of the genome. 

In contrast, while tie ISZ Z.A 35028 slrdr. can be regarded as a totally negative st-rrrr. cs it T .r r-** 
whole region encompassing the RNA helicase and protease ORFs, the Rd genome contains at this 
location a DNA segment of 1 . 1 kb, which encodes two short ORFs of unknown function. This region 
is characterized by an abnormal GC content (32%) thus suggesting that an independent 
recombination event has taken place at this site. 

Additional NadA-like molecules were identified in other Haemophilus species, namely H.somnus, 
Hducreyi and Hactinomycetemcomitans (also known as Actinobacillus actinomycetemcomitans). 



Bacterium 


Amino acid 


Nucleic acid 


Figure 


H. influenzae biogroup aegyptius 


SEQ ID NO: 1 


SEQ ID NO: 20 


1 


H.somnus strain 129PT 


SEQ ID NO: 5 


SEQ ID NO: 21 


2 
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ducreyi 


SEQ ID NO: 6 






| H.actinomycetemcomitans 


SEQ ID NO: 4 







NadA and the H.actinomycetemcomitans sequence align as follows: 

10 20 30 40 50 

actac . pe MTYQLFKHHLVALMVTGAISVNALAKDSFLENPSANLPQQVFKNR — VD — IFNNETNI 

I : : : | | : I : I : II : I : : I 
NadA. pep TIYDIGEDGTITQKDATAADVEADDFKGLGLKKWTNLTKTVNENKQNVDAKVKAAESEI 

60 70 80 90 100 110 



60 70 80 90 100 110 

actac .pe NENKKDIAINK2\NIASIEKDVMRNTGGIDRI*AKQELVNRARITKNELDIRKNTKSIAENT 

: : : | : 1 : | : : : : : | : : : : : | : : : II : : I :| h 

NadA. pep EKLTTKLADT DAALADTDAALDETTNALNKLGEN ITTFAEETKTNIVKIDEKL 

120 130 140 150 160 170 



120 130 140 150 160 

actac . pe ASII'V-RIDGNLEGVNRVLQNVDVRSTE NAARSRANE — QKXAENKKAIENKA 

: : | : | : | : | : : : : I : I : I I : : I : 1 I : : : I I I : I 

NadA . pep EAVAUTVDKHAEAFNDIADSIiDETNTKADEAVKTANEAKQTAEETKQNVDAKVKAAETAA 

180 190 200 210 220 230 



170 180 190 200 210 220 

actac . pe DKADVEKNRADIAAN-SRAIAT-FRSSSQNIAALTTKVDRNTARIDRLDSRVNELDKEVK 

| | : : : I : | | : : : I : I : : : : I I : : : : I : II I I I t • I : i I I : : 
NadA . pep GKAEAAAGTANTAADKAEAVAAKVTDIKADIATNKADIA^ 

240 250 260 270 280 290 



230 240 250 260 270 280 

actac . pe NGIiASQAALSGLFQPYNVGSLNLSAAVGGYKSKTALAVGSGYRFNQNVAAKAGVAVSTN- 

: I 1 I I I I I I I I I I I I I II : I : : I I I I I I I I : : I : I : I - I : I I : : I I I I I I I M : I : 
NadA. pep QGLAEQAAL S GLFQPYNVGRFNVTAAVGGYKSE S AVAIGTGFRFTENFAAKAGVAVGT S S 

300 310 320 330 340 350 



290 

actac.pe GGSATYNVGLNFEW 

I : I I : I : I I : I : I I 
NadA. pep GSSAAYHVGVNYEW 

3 60 



NadA and the Hsomnus sequence align as follows: 



37.0% identity in 284 aa overlap 



90 100 110 120 130 140 

H . somnus . pep EVIKGWNEVKSLPRIDGNGKDKQTKDQIAMLIRTVDNTKELGRIVSTNIEDIKNLKKELY 

| ||: ::::::! { : I : 

NadA. pep MSMKHFPSKVLTTAILATFCSGALAATSDD — D VKKAAT VAI VAAYLvi X G Q el 1 N 

10 20 30 40 50 

150 160 170 180 190 

H . somnus . pep GF VEDVNES EARNISRIDENEKDIKNL — KKELYDFVEDVNESEARNISRID 

|| : I : : I : : : : I : I : I : I | | : : : : : I I I : : : : : 

NadA . pep GFKAGET I YDIGEDGT ITQKDATAADVEADDFKGLGLKKWTNLTKT VNENKQNVDAKVK 

60 70 80 90 100 110 



200 210 220 230 240 250 

H . somnus . pep ENEKDINTLK-ELMDED — LNS VLTQIE DVKLTFQDVNDNVNLAFEE INGNAQK FDTAIE 

| : : I : I : I I I I :::::: : :::::: I : : I I : I I : I : I 
NadA. pep AAESEIEKLTTKEiADT DAALADTDAALDETTNALNKLGEN ITT FAEETKTNIVKI DEKLE 

120 130 140 150 160 170 

260 270 280 290 300 310 

H . somnus . pep GLTSGLSDLQAKVDANKQETEDDIADNAKAIHSNTKGIAKNTKDIRDLDTKTKQMLENDK 

: : : : I I : I : : I I I I : : : : : : I : : : : : : : It I 
NadA. pep AVAD TVDKHA-EAFNDIADSLDETNTKADEAVKTANEAKQTAEETKQ 
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180 



190 



200 



210 



320 330 340 350 . 360 370 

H . somnus . pep NLMTGLESLATETSKGFERFDVKTQQLDQAV 

I : : : : : 1 : : I : : : I : I I I : : : 1 : I | 1 : : : : I : I 

NadA . pep NVDAKVKAAETAAGKAEAAAGTANTAADKAEA-VAAKVTDIKADIATNKADIAK^ 
220 230 240 250 260 270 



380 390 400 410 420 

H . somnus . pep TLDKN TKAGIASAVALGMLPQSTAPGKSLVSLGVGHHRGQSATAIGVSSMSSN 

: I 1 I I 1 : I : I : I I : I I 1 : I : : I I : : : : I I : I 1 I : : : : 

NadA. pep SLDKOTANLRKETRQGIAEQAALSGLFQPYNVGRFNVTAAVGGYKSESAVAIG-TGFRFT 

280 290 300 310 320 330 



430 440 450 

H. somnus . pep GKWWKGGMSYDTQRHATFGGSVGFFFN 



• • • . i • t • • i 



1 1 



NadA. pep EN FAAKAGVAVGT S S GS S AAYHVGVN YEW 

340 350 360 

NadA and the H. ducreyi sequence align as follows: 



23.2% identity in 354 aa overlap 



150 160 170 180 • 190 200 

H. ducreyi .pe SKNKQNIDTISBCYLLELGTYLDGSYRMMEQNTHNINKNTHNINKNTHNINKLSKELQTG1, 

I : I I I : I : : : I : : ! I I : 1 I 

NadA . pep EAAAGTANTAADKAEAVAAKVTDIKADIATNKADIAKNSARIDSLDKNVANLRKETRQGL 

M0 250 260 270 280 290 

210 220 230 240 250 260 

H . ducreyi . pe ANQSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVGSRITDRFTAKAGVAFNTYNGG- 

1 : I : I I I i II : I I : : I : I I I I I ! : : : : I : II I : I I : I : I : I ! I I I I : I *• I : 
NadA. pep AEQAALSGLFQPYNVGRFNVTAAVGGYKSESAVAIGTGFRFTENFAAKAGVAVGTSSGSS 

300 310 320 330 340 350 

270 

H. ducreyi. pe MSYGASVGYEF 

: 1 : ; I : I I : 
NadA. pep AAYHVGVN YEW 

360 

47.5% identity in 101 aa overlap 

NB: the coiled-coil prediction for the H. ducreyi polypeptide is not high. 



Other bacteria 

Further NadA homologs identified in the search are: 
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Brucella melitensis 


SEQ ID NO: 12 


SEQ ID NO: 32 


10 


Brucella suis 


SEQ ID NO: 13 


SEQ ID NO: 33 


11 


Ralstonia solanacearum 


SEQ ID NO: 14 


SEQ ID NO: 34 


12 


Sinorhizobium meliloti 


SEQ ID NO: 15 


SEQ ID NO: 35 


13 


Bradorhizobium japonicum 


SEQ ID NO: 16 


SEQ ID NO: 36 


14 


Burkholderia fungorum 


SEQ ID NO: 17 


SEQ ID NO: 29 


15 



Multiple sequence alignment 

A multiple sequence alignment of members of the NadA "family" is below: 

10 20 30 40 50 60 
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i I I I I I I 

'961_HI MKRNLLKQSVIAVLIGGTTVSN 

9 6 1__ACTAC MT YQLFKHHLVALMVTGAI SVNAL 

MenB NadA -MSMKHFPSKVLTTAII1ATFCSGAI1AATSDDDVKK AATVAIVAAYNNGQEINGFKAG 

YADA__YEREN — MTKDFKI SVSAALI S ALFSSP YAFADDYDGI PN LTAVQISPNADPALGLEYPVRP 

961_HAESO MKKVQFFKYSSLALALGLGVSASALAAPTSTSTTTGPEAPPTGPAPTAKDPLAETALAYD 

9 6 1_K1 MKT VNV ALLALI I S AT SS P WL AGDT I EAAAT 

9 61_HAEDU MKIKCLVAV VGLACST ITTMAQQP 

Prim. cons. M23MK42K22LLA2AI2A2FS2GALAA2T6D444TGPEA33V3I3P3A333L33333333 

70 80 90 100 110 120 

I I I I I I 

961_HI YALAQAQAQAQVKKD 

961_ACTAC AKDSFLENPSANLPQQVFKNR VDIFNNET 

MenB NadA ET I YD I GE DGT ITQKDAT AADVEAD DFKGLGIiKKVVTNLTK 

Y ADA_YE REN PVPGAGGLNASAKGIHSIAIGATAEAAKGAAVAVGAGSIATGVNSV AIG 

961_HAESO LENE VAYLRMKAGEWMQLGLDPEKEVI KGWNEYKSLPRI DGNGKDKQTKDQI AMLI RTVD 

961_K1 EL SAINS GMSQS E I EQKITRFLERT DNS PAAYT 

961_HAEDU PKFAGVSSLYSYEYDYGKGK 

Prim. cons. 333333GL4A2A6677SS2ADAEA3VFKGL444255PNI5T22222QTKDQIAMLIR222 

130 140 150 160 170 180 

I I i i 1 I 

961_HI ELSELKKQVKEMDAAIDGILDDNIAYEAEVDAKLDQHSAALGRHTNRLNNLKT 

961_ACTAC NINENKKDIAINKANIASIEKDVMRNTGGIDRLAKQELVNRARITKNELDIR 

MenB NadA TVNENKQNVDZVKVKAAESEIEKLTTKLADTDAALADTDAALDETTNALNKLGE 

YADA_YEREN PLSKALGDSAVTYGAASTAQKDGVAIGARASTSDTGVAVGFNSKADAKNSVAIGHSSHVA 

961_HAESO NTKELGRIVSTNIEDIKNLKKELYGFVEDVNESEARNISRIDENEKDIKNLKK 

961__Kl YLTEHHYIPSETPDTTQTPPVQTDPDAGQKTVAATGWQIPARYQSMINARQS 

961_HAEDU WTWSNEGGFDIKVPGIKMKPKEWISKQATYLELQHYMPYTPVLVTSAPDVSPS 

Prim, cons . NL2ENK22V323VAAIK2IPKDLIAK7ADVD23^22V72A22R7T3A2NNLKSGHSSHVA 

190 200 210 220 230 240 

1 I I 1 I I 

9 6 1_HI 1 AEKAKGDS SEALDKI EALEEQNDE- 

9 6 1_ACTAC KNTKSIAENTASIARI DGNLEGVNR- 

MenB NadA NITTFAEETKTNIVKI DEKLEAVADT 

YADA_YEREN ANHGYS I AI GDRSKTDREN S V S I GHESLNRQLTHLAAGTKDTDAVN V AQLKKE I EKTQEN 

96l_HAESO ELYDFVEDVNESEARNISRXDENEKDINTL 

9 6 1_K1 AVTDAQQTQITEQQAQI VATQKTLAAT 

9 6 1_HAEDU S I S I LLYPMS DP DQLG INRQQLKM 

Prim, cons . ANHGYSIAIGDRSKTDRENSVSIGHESLNR2L23 6A2K7KEE72ENIAQID2N2EQ22E2 



250 260 270 280 290 300 

I I I I 1 I 

961_HI FLADITALEEG VDGLDDDIAGIQDN 

9 61_ACTAC VLQNVDVRSTENAA RSRANEQKIAENKKA 

MenB NadA VDKHAEAFNDIADSLDETNTKADEAVK TANEAKQTAEETKQN 

YADA_YEREN TNKRS AELLANANAYADNKSSSV-LGIANNYTDSKSAETLENARKEAFAQSKDV 

9 61_HAESO KELMDEDLNSVLTQIEDVKLTFQDVNDNVNLAFEEINGNAQKFDTAIEGLTSGLSDLQAK 

961_ Kl GDTQN TAHYQEMINARLAAQNEAN QRTTTEQGQKMNALTTD 

961JHAEDU IiYSYFI'TDIjP'IDFK LKVLDARI SKNKQN 

Prim. cons. 4DK44E22N34257LA22227A225A52VNL2222222222223TT7N3L2QKIAE2K2N 



310 320 330 340 350 360 

i I I I I I 

961_HI ISDIED DINQNSADIATNTAAIATH 

9 6 1_ACTAC IENKADKA DVEKNRADIAANSRAIATF 

MenB NadA VDAKVKAA ETAAGKAEAAAGTANTAAD 

YADA_YEREN LNMAKAH S NS VARTTL ET AEEHAN S VAR TTLETAEEHANKKSAEALASANVYADS 

961_HAESO VDANKQETE DDI ADNAKAI HS NT KGI AKNTKDI RDLDTKTKQMLENDKNLMTGLE S LATE 

9 6 1_K1 VAAQQQKE RAQYDKQMQSLAQKS VQAHE 

961_HAEDU IDTISK YLLELGTYLDGSYRMMEQN 

Prim, cons . 2DA2K3KA222222222222222222A2NTKDI22L2T223D722NSA23AA3T22IATE 



370 380 390 400 410 420 

I t I 1 1 I 

961_HI TQRLDNLDNRVNNLNKDLKRGLAA 

9 6 1_ACTAC RSS SQ NIAALTTKVDR NTARIDRLDSRVNELDKEVKNGLAS 

MenB NadA KAE AV AAKVT DI KAD I ATNKAD I AK NSARIDSLDKNVANLRKETRQGLAE 

YADA YEREN KSS HTIiKTANSYTDVTVSNSTKKAIRESNQ-YTDHKFRQLDNRLDKLDTRVDKGLAS 
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961_HAESO 

961_K1 

961_HAEDU 

Prim, cons . 



T SKGFERFDVKTQQLDQAVANVVGRVD I TEQ AI RQNT AGLVNVNKRVDTLDKNTKAGI AS 

QIES LRQDSAQTQQQIiTNTQKRVADNSQQINTLNNHFDSLKNEVEDNRKEANAGTAS 

THN IN KNTHNINKNTHNINKLSKELQTGLAN 

2S22FE4544K44Q44Q5IANN6T2VAI3EQ3I24NTARID2LDNRVN2LDKE3KAGLAS 



961 HI 

961_ACTAC 

MenB NadA 

YADA__YEREN 

961_HAESO 

961_K1 

961_HAEDU 

Prim, cons . 



430 440 450 460 470 480 

I I I I I I 

QAALNGLFQPYNVGKLNLTAAVGGYKSQTAVAVG 

QAALSGLFQPYNVGSLNLSAAVGGYKSKTAIiAVGSG-YRFNQNVAAKAGVAVST-N-GGS 

QAALSGLFQPYNVGRFNVTAAVGGYKSESAVAIGTG-FRFTENFAAKAGVAVGTSS-GSS 

SAALNSLFQPYGVGKVNFTAGVGGYRSSQALAIGSG-YRVNENVALKAGVAYAG SSD 

AVALGMLPQSTAPGKSLVSLGVGHHRGQSATAIGVSSMSSNGKWVVKGGMSYDTQR-HAT 
AIAIASQPQVKTGDVMMVSAGAGTFNGESAVSVGTS-FNAGTHTVLKAGISADTQS-DFG 
QSALSMLVQPNGVGKTSVSAAVGGYRDKTALAIGVG-SRITDRFTAKAGVAFNTYNGGMS 

*: * ..* .... * ::* : 

QAAIjSGLFQPYI!IVGKI 1 NVSAAVGGY2S32A2AIG3GS2RFNEN2AAKAGVA2DTQ2GGSS 



961_HI 

961_ACTAC 

MenB NadA 

YADA__YEREN 

961_HAESO 

961_K1 

961_HAEDU 

Prim, cons . 



490 
I 



AT YNVGLN FEW 
AAYHVGVNYEW 
VMYNASFNIEW 
— FGGSVGFFFN 
— AGVGVGYSF 
— YGASVGYEF 

* mm % m 

* * ♦ * * 

AG Y 2 VG VN FEW 



It will be understood that the invention has been described by way of example only and modifications 
may be made whilst remaining within the scope and spirit of the invention. 
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^QUENCE LISTING 

SEQ ID NO: 1 (Haemophilus aegyptius) 

MKRNLLKQSVIAVLIGGTTVSNYALAQAQAQAQVKKD^ 

KTIAEKAKGDSSEALDKIEALEEQNDEFIADITALEEGVDGLDDDIAGIQDNISDIEDDINQNSADIATNTMIATHTQRLDNLDNRV 

NNLNKDLKRGLMQAALKGLFQPYNVGKLNLTAAVGGYKSQTAVAVG 

SEQ ID NO: 2 (Escherichia coli) 

MKTVNVALLALIISATSSPWLAGDTIEAAATELSAINSGMSQSEIEQKITRFLERTDNSPAAYTYLTEHHYIPSETPDTTQTPTVQT 
DPDAGQKTVMTGDVQTTMYQSMINARQSAVTDAQQTQITEQQAQIVATQKTLAATGDTQNTAHYQEMINARLAAQNEANQRTATEQ 
GQKMNZVLTTDVAVQQQNERTQYDKQMQSLAQESAQAHEQIDSLSQDVTQTHQQLTNTQKRVADNSQQINTLNNHFSSLKNEVDDNR^ 
AMGTASAIAIASQPQVKTGDVMMVSAGAGTFNGESAVSVGTSFNAGTHTVLKAGISADTQSDFGAGVGVGYSF 

SEQ ID NO: 3 (EHEC) 

MNKI FKVIWNPATGNYTVTSETAKSRGKKSGRSKLLI S ALVAGGMLS S FGALAN AGN DNGQGVDYGSGSAGDGWVAIGKGAKANTFMNTSGS STAVG 
YDAIAEGQYSSAIGSKTHAIGGASMAFGVSAISEGDRSIALGASSYSLGQYSMALGRYSKAL^ 

GDTANASKAYSMALGAS SVASEENAI AIGAETEAAENATAI GNNAKAKGTNSMAMGFGSLADKVNTIALGNGSQALADNAI AI GQGNKADGVDAIAL 
GNGSQSRGLNTIALGTASNATGDKSLALGSNSSANGINSVAL^ 

GGGAAVDVDDGTVTAPT YlftiKNGSKNNVGAAM S PTASVITDVADGT I SASSKDAWGSQLKATNDDVEANTAN 

IATNTSNIATNTMIATNTTNITNLTDSVGDLQAD^ 

SNIATNTTNISNLTEWTNLGEDALKWDKDNGVFTAAHGTO 

LGEDALKWDKDNGVFTAARGtmTASKITNI^^ 

ST S LGDALLWDATAGKFSAKHGTNGDASVI TDVADGEI SDS SSDAVNGSQLHGV S S YWDALGGGAEVNADGTITAPT YT I ANAD YDNVGDALNAI D 
TTLDDALLWDADAGENGAFSAAHGKDKTASVITNVANGAISAASSDAINGSQLYTTNKYIADALGGDAEVNADGTITAPTYTIANAEYNIT7 

LDDNALLWDETMGGAGAYNASHDGKASIITNVMGSISEDSTDAVNGSQ 

DASAQGVGATAIGYNSVAKGDSSVAIGQGSYSDVDTGIALGSSSVSSRVIAKGSRDTSITENGWIGYDTTDGELLGALSIGDDGKYRQIINVADGS 

EAHDAVTVRQLQNAIGAVATTPTKYFHMSTEEDSLAVGTDSLAMGAKTIVNGDKGIGIGYGAYVDMALNGIM 

GAQTNYTAYMDAPQNSVGEFSVGSADGQRQITNVAAGSADTDA^ 

KTNTDGVDASAQGKDSVAIGSGSIAAADNS VALGTGSVATEENTI S VGS STNQRRITNVAAGKNATDAVNVAQLKS S EAGGVRYDTKADGS I DYSNI 

TLGGGNGGTTRISNVSAGVNNNDVVNYAQLKQSVQETKQYTDQR1WEMDNKLSKTESKLSGGIASAMAM 

LGVSMVSANGRWVYKLQGSTNSQGEYSAALGAGIQW 

5 SEQ ID NO: 4 (Actinobacillus actinomycetemcomitans) 

MTYQLFKHHLVAIWTGAISVNALMDSFLENPSANLPQQVFKNRVDIFNNETNINEMKDIAINKMIASIEKDVM 
QELVNRARITKNELDIRKNTKSIAENTASIARIDGNLEGV^ 

AANSRAIATFRSSSQNIAALTTKVDRNTARIDRLDSRVNELDKEVKNGLASQAALSGLFQPYNVGSLNLSAAVGGYKSKTALAVGSGY 
RFNQNVAAKAGVAVS TNGGSAT YNVGLNFEW 

SEQ ID NO: 5 (Haemophilus somnus) 

MKKVQFFKYSSLALALGLGVSASALAAPTSTSTTTGPEAPPTGPAPTAKDPLAETALAYDLENEVAYLRMKAGEWMQLGLDPEKEVIK 
GWNEVKSLPRIDGNGKDKQTKDQIAMLIRTVDNTKELGRIVSTNIEDIKNLKKELYGFVEDVNESEARNISRIDENEKDIKNLKKELY 
DFVEDVNESEARNISRIDENEKDINTLKELMDEDLNSVLTQIEDVKLTFQDVNDNVNLAFEEINGNAQKFDTAIEGLTSGLSDLQAKV 
DANKQETEDDIADNAKAIHSNTKGIAKNTKDIRDLDTKTKQMLENDKNLMTGLESLATETSKGFERFDVKTQQLDQAVANVVGRVDIT 

EQAIRQNTAGLVIWNKRVDTLDKNTKAGIA 
ATFGGSVGFFFN 

SEQ ID NO: 6 (Haemophilus ducreyi) 

MKIKCLVAWGLACSTITTMAQQPPKFAGVSSLYSYEYDYGKGKWTWSNEGGFDIKVPGIKMKPKEWISKQATYLELQH 
TSAPDVSPSSISILLYPMSDPDQLGINRQQLKLNLYSYFNDLRHDFKLKVLDARISKNKQNIDTISKYLLEl^TYLDGSYRMMEQNTH 

NINKNTHNINKNTHNINKLSKELQTC 
YGASVGYEF 

SEQ ID NO: 7 (EPEC) 

MKTVNVALLALIISATSSPFVLAGDTIEAMTELSAINSGMSQSEIEQKITRFLERTDNSP2\AYTYLTEHH 

DPDAGQKWMTGDVQTTARYQSMINARQSTVTDAQQTQ^ 

GQKMNALTTDVAAQQQKERAQYDKQMQSLAQ 

AMGTASMAIASQPQVKTGDLMIOTSAGAGT^ 
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^Q IDNO: 8 (EAEQ 

MKTVKLSLUVVWMMSPSAFAGDTVEMTTELTVIQPGMSQSEIDQKIGRFLERTGNSVMQNYLIMDYQTTTPQENTMSPVQP 

TNTLNPITNQAQTDRDNGQDTAIQDAQH^^ 

MMMTALINGVKLSGAVTENKNNIEQNRSDI^ 

NKMITDIRNDVSAVQSDVTNIKGDVAMQSTADHMANMTALMNGVKLSSAVTENKNNIEQNRSDIADQQKLLASNE 
QDTAIQDAQHMNWASMKADDAQHAITVAQ^ 

VTENKNNIEQNRSDIADQQQQLDETRKIVAATGDVQTAARYQSMIDARQTAAANAQQAQADTQQQQMDDQQKQIDATQKTVSALGDAQ 
TNAHYQEMVNAGLRAQNDANMTAAEQKQKIDTMTNQATQQHINSVQYGEQIQRLAQDSTQTHEQIDSLTQDVTQTHQQLSNTQKRV 
ADNSQQITTLNNHFSSLKNEVEDNRKEANAGTASAIAIASQPQVKAGDFMMMSAGAGTFNGESAVSVGTSFNAGTHTVIKAGVSADTQ 
SDFGAGVGVGYSF 

SEQ W NO: 9 (EAEQ 

• 

MMIFKVIWNPATGSYTVASETMSRGKKSGRSKLM 

GGGSSTAVGYHSTADGRWSTALGAKTHSLGEASVALGINTTSAGERSLAIGASATSTGGFSIALGRYANSVGEFSIAQGDHAETGADD 
AIAFGRESKALGIMSIALGATMASKEYAMALGASSAASAMA 

TNAEMGLNSIALGSGSIADVDNTIALGNQSQAVMGAIAIGQGNKADGMAIALGNGSITGGVNAIALGQGSYAGLENGTAIGAQAS 

AQGKNSVALGAGSVATDADTVSVGNTTAQRQIVNMAAGDISTTSTDAINGSQLYAISKSVADNLGGGATVNAQGWTSPNYRLKSGIF 

GTVGDALTGLDMTLQWDSLKKAYSAMGTDTTSTITNVKDGAISDTSKDAVNGSQLKTTNDWATNTANITTNTNSINTLTDSVGDL 

KDDALLWNGTAFSAAHGTEATSKITNVKDGDL^^ 

ATSKITNVKDGDLTAGSTDAVNGSQLKTTNDAVAA^ 

GDLTAGSTDAVNGSQLKTTNDAVAANTTNIATNTTNITNLTDAVDSLGDDSLLWNATAGAFSAMGTDATSKITNVKDGDLTAGSTDA 

WGSQLKTTNDAVAANTTNIATNTTNITNLTDAVDSLGDDSLLWNATAGAFSAKHGTNGTDSKITNLLAGTVSSDSTDAI 

ADSFTSYLGGGADISDAGVLTGPTYTIGGTDYNNVGDALAAINTSFSTSLGDALLWDATAKGGDGAFSAGRGTDNTASIITNVADGAI 

SSTSSDAINGSQLYDTSKYIADTLGGDAEVNADGTITAPTYAIAGGSYSNVGDALEAIDTTLDDALLWDATANDGNGAFSAAHGKDKT 

ASVITNVANGAISATSSDAINGSQLYTTNKYIADALGGDAEVNADGSITAPTYTIANMYNNVGDALDALDDNALLWDATANDGAGAY 

NASHDGKASIITOTADGNIGEGSTDAINGSQLFNTMLIQQNSEIINQLAGNTSETYIEDNGAGINYVRTNDNGMFNDASASGIGAT 

AVGYNAVASGESSVAIGQGSSSNVDTGIALGSSSVSSRVIVKGSRDTSVSEEGWIGYDTTDGELLGALSIGDDGKYRQIINVADGSE 

AHDAVTVRQLQNAIGAVATTPTKYFHANSTEEDSLAVGEDSI^ 

GNGSQTTRGAQTGYAAYNMDAPQNS VGE FS VG S E DGQRQI TNVAAGS ADT DAVNVGQLKVT DAQVSQNTQS I TNLNNQVTNLDTRVTN 
IENGIGDIVTTGSTKYFKTNTDGVDANAQGKDSVAIGSGSIAAADNSVALGTGSVANEENTISVGSSTNQRRITNVAAGVNATDAVNV 
SQLKSSEAGGVRYDTKADGSVDYSNITLGGGNGGTTRISNVSAGVNNNDAWYAQLKQSVQETKQYTDQRMVEMDNKLSKTE 
IASAMAMTGLPQAYTPGASMASIGGGTYNGESAVALGVSMVSANGRWVYKLQGSTNSQGEYSAALGAGIQW 

SEQ ID NO: 10 (UPEQ 

MNKIFKYIWNPATGSYTVASETAKSRGKKSGRSKLLISALVAGGLLSSFGASADNYTGQPTDYGDGSAGDGWVAIGKGAKANTFMNTS 

GASTALGYDAIAEGEYSSAIGSKTLATGGASMAFGVSAKAMGDRSVALGASSVANGDRSMAFGRYAKTNGFTSLAIGDS 

ALGNTAKAYEIMSIALGDNANASKEYAMALGASSKAGGADSLAFGRKSTMSTGSLAIGADSSSSNDNAIAIGNKT^ 

ASQASGESSIALGNTSEASEQNAIALGQGSIASKVNSIALGSNSLSSGENAIALGEGSAAGGSNSLAFGSQSRANGNDSVAIGVGAAA 

ATDNSVAIGAGSTTDASNTVSVGNSATKRKIV^^yIMGAISNTSTDAINGSQLYTISDSVAKRLGGGATVGSDGTVTAVSYALRSGTYN 

NVGDALSGIDNNTLQWNKTAGAFSMHGANATNKITNVAKGTVSATSTDWNGSQLYDLQQDALLWNGTAFSMHGTEATSKITNVTA 

GNLTAGSTDAVNGSQLKTTNDNVTTNCT^ 

VNGSQLKTTNDNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVTAGNLTAGSTDAVNGSQLKTTN 
DNVTTNTTNIATNTTNITNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVtCAGDLTAGSTDAVNGSQLKTTNDNVSTNTTNI 
TNLTDAVNGLGDDSLLWNKTAGAFSAAHGTDATSKITNVKAGDLTAGSTDAVNGSQLKTTNDNVSTNTTNITNLTDSVGDLKDDSLLW 
NKMGAFSAAHGTEATSKITNLLAGKISSNSTD^^ 

FSTSLGDALLWDATAGKFSAKHGINNAPSVITDVANGAVSSTSSDAINGSQLYGVSDYIADALGGNAWNTDGSITTPTYAIAGGSYN 

NVGDALEAIDTTLDDALLWDTTMGGNGAFSAAHGKDKTASVITNVMGAVSATSNDAINGSQLYSTNKYIADALGGDAEVNAD 

APTYTIANTDYNNVGEALDALD^ALLWDEDAGAYNASHDGNASKITNVAAGDLSTTSTDAWGSQLNATNILVTQN^ 

SETYIEENGAGINYVRTNDSGLAFNDASASGIGATAVGYNAVASHASSVAIGQDSISEVDTGIALGSSSVSSRVIVKGTRNTSVSEEG 

WIGYDTTDGELLGALSIGDDGKYRQIIIWADGSEMDA^ 

AGIGIGLNTLVLADAINGIAIGSNARM^ 

NVGQLKVTDAQVSQNTQSITNLNTQVTNLDTRVTNIENGIGDIVTTGSTKYFKTNTDGADANAQGKDSVAIGSGSIAAADNSVALGTG 
SVADEENT-ISVGSSTNQRRITNVMGVNATDAVNVSQL^ 

AQLKQSVQETKQYTIXJRMVEMDNKLSKTESKLSGGIASAMMTGLPQAYTPGASMASIGGGTYNGE 
STNSQGEYSAALGAGIQW 
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^QWNO: 11 (Shigella flexneri) 

MTNLGEDALKWDKDNGVFTAAHGTETTSKIT^ 

WDKDNGVFTMHGNNTASKITNILDGTVTATSSDAINGSQLYDLSSNIATYFGGNASWTDGVFTGPTYKIGETNYYNVGDALMINS 
SFSTSLGDALLWDATAGKFSAKHGTNGDASVITDVADGEISDSSSDAVNGSQLHGVSSYWDALGGGAEVNADGTITAPTYTIANADY 
DNVGDALNAI DTTPDDALLWDADAGENGAFSAAHGKDKTASVITNVANGAISAASSDAINGSQLYTTNKYIADALGGDAEVNADGTIT 
APTYTIMAEYNNVGDALDALDDNALLWDKTANGGAGAYNASHDGKASI^ 

AGil T DAT 1 I hiiiN G AGI N YVRT 'A DiiAF H DAS AS G VGAT AVG YN AVAS GAS S VAI GQN S S S TV DTG 1 ALG S S S VS S RVI AKGS RDT S V 
TENGWIGYDTTDGELLGMiSIGDDGKYRQIINTO^ 

VNGNAGIGIGLNTLVLADAINGIAIGSNARANHANSIAMGNGSQTTRGAQTGYTAYNMDAPQNSVGEFSVGSEDGQRQITNVAAGSAD 

TDAVNVGQLKVTDERVAQNTQSITNLNNQVTNLDTRVTNIENGIGDIVTTGSTKYFKTNTDGVDANAQGKDSVAIGSGSIAAADNSVA 

LGTGSVAEEENTISVGSSTNQRRITNVAASVNATDAVNVSQLKSSEAGGVRYDTKADGSIDYSNITLGGGNGSTTRISNVSAGVNNND 

AVNYAQLKQSAQETKQYTDQRMVEMDNKLSKTE 

KLQGSTNSQGEYSAALGAGIQW 

SEQ ID NO: 12 (Brucella melitensis) 

MSFFKKNISITMGGLMLSLAVDAAKAEENVSQVKLPPVFVFELVENQGLMIALIRPRVIAPDNNLRP 

LISENRQVINNNTTAIGQNRTSISTNAKGVADNRMIRQNSMISALGQRVDGLQGQINSMKEAMGAANAAALSGLRYDNRPGKVS 
IATGVGGFKGSTALAAGIGYTSKNENARYNVSVAYNEAGTSWNAGASFTLN 

SEQ ID NO: 13 (Brucella suis) 

MSFFKKNISITMGGLMLSLAVDAAKAEEtWSQVKLPPV 
LISENRQVINMTTAIGQNSDRIDMAKGVADNRMIGQ 
NAKGVADNRAAISQNRGRINMMGVASNRAAIRQNSAAISA^ 
VGG FKGST ALAAGI G YT S KNEN ARYN V SV AYNE AGT S WN AGAS FT LN 

SEQ ID NO: 14 (Ralstonia solanacearum) 

MVFSMPQYACAEMLLQNDPGTNCGSVGDAYAWARGDGYSGCKVGYEAAK^ 

LDSMNIGGHLDVWGASGFHGGVDMNNSAIKNLADGTLSATSTEAVTGRQLNATNTNITNLQNSIKSISSSASLVQQSAAGKDITVAKD 
LDGDAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTDAVSGKQLYTTNQ^ 

ISGGSAGLVQQSAAGKDITVAKDLDGEAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTDAVSGKQLYTTNQNLASTNKDLANTNTRL 

TTAEGNLSSNTTSITNLQNTIKNISGGSAGLVQQSAAGKDITVAKDLDGDAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTDAVSGR 

QLYTTNQNLSTTNQNLADTNKSLAETNKtWSATTTNITN^ 

SRKLTGVAEGTLSATSTDAVSGKQLYTTNQNLSTTNQN^ 

VMNLDGDAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTDAVSGKQLYTTNQNLASTNKDLANTNTRLTT7VEGNLSSNTTSITNLQN 

TIKNISGGSAGLVQQSAAGKDITVAKDLDGDAVDFSGKNLSDSTTFSRKLTGVAEGTLSATSTDAVSGKQLYTTNQNLSTTNQNLADT 

NKSLMTMNVSATTTNITNLQNTVNNISSGSAGLVQQSAAGKDITVMDLDGDAVDFSGKKLSDSTTFSRKLTGVAEGTLSATSTD 

VSGKQLYATNQWSKLSMVTDVSDSVTNIKNTMNTIVNGGGLKYFHMSTLDDAQMGLESIAFGGAAVAAGMNSMAMGGNARAVAG 

NAVALGAGSVADRANTVSVGSAGKERQITNVAAGTADTDAVNVAQLE(AAGIINGSGRTNATVTYGTNADGSADYGNVTLGGGNAPAGT 

AIHNVMGTMTDA\nWRQMNAAIASVQKVSNTNDPMFMDGDRAVKRASAKGTHATMGAAASAGGDQSVA 

MAKATMHAVAVGSGSVMRMTMSVGSAGSERQITOTAAGVQGTDAVNVSQLSQAVYAAVGDLPAGTTARQYTDEQIGMVRQGISQ 

VARGAYSGIAAATALTMIPDVDQGKSIAIGIGSATYKGYQAVALGASARISHNLKAKMGVGYSSEGTTVGMGASYQW 

5 SEQ ID NO: 15 (SinorMzoblumi meliloti) 

MALGRQSVSAGSGSIAFGNGSYMSNGSVAIGQSAYAANVRAIAIGGDDAFAW 

EADPGGASDAIAIGTDAQANGDRSLAIGRQNQAGNEQSIGIGAGNTATGKLSIGIGSSNVASGEQSLSLGAGNNALGQGSISIGTETT 

AGGLRS I AFGVRASTKEANLDI P DDVAAI DAI AIGTNTKANGDRSVS IGTGSQASSGAVS IGDAAKAVGDKSVS IGTESWADGDESVS 

IGLWNAGFEGNDRIKGGQTSVSLGAFNQSPGIEAIAIGARNEANADRSIAIGSRAKTiCAADPAQADGGARDAVAIGTDALANDDRSI 

SIGWNSSTSLNDSISIGTRATSGSAGDIMIGTGSGTGSTSGQNNVALGVAASQKVKGSSNIAIGDSAGGSREGDNNVAIGTNAGIQFS 

ESEHETAVRADLWSDAVSIGNEALASADEAIAIGTGAVASGLKSISIGVGNTVSGASSGAIGDPTDITGTGSYSLGNDNTIAADNAG 

TFGNDNTLADAADGSRVIGNGNNIDVSDAFVLGNGADVTEVGGVALGSGSVSDTGADVAGYVPGGASTADQNAIEATQSTRGAVAVGN 

PDAETGVYRQITGVAAGTADSDAAWAQLKSVETIMTGWKLTTDSGSI 

ttTCVTMDPDTGASTVLDENGLSFTTQDANGEDTALGPRVT^ 

DMTNGVIDEGELNLDSVTLAGGMGGTRISNLAPGALSA^ 

YSNVGDAIJ^VGGTAGAGWSLSAQGANASWAPGETVDLRSGDGNI 

GGSIVIGSTMLGSNGLVITGGPSVTTDGIDAGGMKVTNVANGTV^ 

ITIWMGDLNMSTDAWGSQLYATNVKVDRLDTEVKEIDSRVTYIESFQ 
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^KpOlTDAVNVGQLDESVAESKSYTDEKT 

^RPGKLSVMiGGGFWRSEGMiAFGAGYTSEDGRVRMLTGAMGGNVGVGAGLSITLN 
SEQID NO: 16 (Bradorhizobhim japonicum) 

MRAFGSGNAINGTNYMVGSNNWAGMGAWGSGNGVTGDOT^ 
GTGQmGNGTFAIGDPNIVNGNNSLVFGDNNTTOGS^^^ 

G2i3S VAI GI1GTAVTG I NAI AMGTGAGAN FDHS VAX G S GAT T T PJ\iTOVAVGT:i\S ST YT1-ISG I T SMSX/^\QSG P TOT : V I ^ T.. ,2\G T v(- ATT 

SLAGLGLASAGDINGINSQLMLNGRVDNL^ 

FTAAQQGAIGVRAGASWTLN 

SEQ IB NO: 1 7 (Burkholderia fungorum) 
MNKTYRSVWIIESTGTW^^ 

SDYQWGLFSOTNADGSKSGQPIGMIEGMNDGSLLLYGPM 

TVMITTGAGIMYFHVNSTMDSTMGVNSIAIGGATRTDMNSISIGTGLTQASSNTGAIAIGQNASINVYGANSIAIGTNSATGGI 

GGAIALGENAFATGGKMLALGSGASATTMSVMGSGSTTTANLTAAGYNPGSGTIAGTSQATNGEVSVGNAGMR 

DAVNVSQLQSEDAKVNTIMNVNNLSGSVTNISSTVNNI^ 

GTNAIALGGAVAGGSYAFAAGSLALAATTGDIALGSSATASSMSNAYATALGTNALMATDATAIGEGASATA^ 
NLSTAGYNPGTGTLSGTTPTGEVSVGSAGKERRVTNVAAGSAATDAVOT 

NITNGGGIKYFHVNSTLADSSAGGTNSIAIGGGATTGNVTAGTSDNISIGTNATTNYGKNIAIGGNAQALGGAYDGGYNTAIGENAIA 
KGDGAGGFGGGGWGQTTAIGGGSQALHDNTTAVGSGAIANVANATALGMSASATAGSAIALGQGAVASAANSVALGSGSTTTXNLSAA 
GYNPGTGTLSGIASVMGEVSVGMGKERRIT1WAAGSAATDAVNVSQ 

FHTKSTMDSSATGTDAVMGGMQATAANSVALGSNSTTTMLSMGYNPGTGALSGIASAMGEVSVGAAGKERRITNVAAG 
DAV1WSQLQSEDAKVNTISITOVMLSG 

SSNTGAIAIGQNASINVYGMSIAIGTNSATGGIGGAIALGENAFATGGK^ 
TIAGTSQATNGEVSVGMGAERRITbTVMGSMTDAVWS 

FHTKSTLADSSATGTDAVAIGGNAQATAANSVALGSNSTTTANLSAAGYNPGTGTLSGTTPTGEVSVGSAGKERRVTNVAAGSAATDA 
VNVSQLQSAIIGSTANAVAYDDGTKATVTLKGASGTKITNLTAGNLSATSTDAVNGSQLYATNQNVSNIGNTVNNITNGGGIKYFHAN 
STQADSSATGSNSVAVGDRASSLGGSSVAMGDGATAVGAASIAIGNNAQNVTGSNNSVAIGGDSKAGDRSVSLGNGADTSLSSWGVAV 
GTNANVSAALGTAIGAGANVSGMSTAIGANAVASATNSVALGSNSTTTMLSMGYNPGTGTLSGIASAANGEVSVGAAGKERRVTN 
VAAGSAATDAVNVSQLQSEDAKVNTINNW^ 

DNMAIGTNATASYGKNIAIGGGAQATGGTYDGGYNVALGENMATAGTNAWGHNTAIGANTVINGVNSVALGISATTSGSGSMAFGSA 

AQASADYAIASGAGMASAWSVALGSNSTTTMLSAAGYNPGTGTLSGIASVMGEVSVGSAGK^ 

SEDMVNTINNNVNNLSNWSNIAGNVTNIS 

NLSAAGYNPGTGTLSGTTPVGEVSVGSAGKERRVTNVAAGSAATDAVNVSQLQSAIIGSTANAVAYDDGTKATVTLKGASGTKITNLT 
AGNLSATSTDAVNGSQLYATNQNVSNVGNTVS^ 

ATAMSVALGSNSTTTMLSAAGYNPGTGALSATTPVGEVSVGSAGKERRVTNVAAGSAATDAVNVSQLMSEDAKVNTINNNVNNLS 

NVSNIAGOTTNISNTVNNITNGGSGIKYFHMSTLADSSATGVDAVAIGGNAQATAANSVALGSNSTTTANLSAAGYNPGTG^ 

SAMGEVSVGMGKERRITNVMGSAATDAVNVSQLQSEDMVNTINNNVNNLSNNVSNIAGNVTNISNTVNNITNGGSGIKYFHANS 

TIiADSSATGTDAVAIGGNASASAANSVALGSNSTTTMLSAAGYNPGSMLSGTASAMGEVSVGAAGKERRITNVMGSAATDAVNV 

SQLQSEDAKVNAEGAATAAALGGGSTYNTTTGAITSPTYIAGGKTFNNVGDWTNIDGRVTQNSTDITNLTTTIDNGTIGLVQQATPT 

STITVMDTGGATVDFRGTGNATRTLTGITAGELSATSTDAVNGSQLYATNQNVSNIDNTVSNLSNNVTNIAGNVTNISNTVNNITNG 

GGGIKYFHANSTLADSSATGVDAVAIGGNAQATAANSVALGSNSTTTANLSAAGYNPGTGTLSGIASAANGEVSVGAAGKERRVTNVA 

AGSAATDAVNVSQLQSEDAKVNTINNNVNNLSNNVSNIAGNVTNISNTVNNITNGGGGIKYFHANSTLADSSATGTNSLAAGPAAVAS 

AT DAVALGNGAKATN AGAVALGAGST TTTAVAT SGTT IGG I T YT FAGVAP S ST VS VGAAG S ERT I TNVAAGRLS AT ST DAVNGS ELFA 

TNQQVTRNTADITNLTNMNIGSVGLVQQDATTRTITVAKATDGTRVDFTGTGGARQLTGVAAGAVNATSVDAVNGSQLYGVSQSVA 

AIGGGSTVNTDGSISAPTYWDGTTVHNAGDAISNLDNRVTQNTTDISTINNTLNSITTGAGVKYVHWSTLADSLAKGAESVA 

AQSQMNSVALGSNSVADRMTVSVGMGAERQITWAAGTADTDAVNVAQLKASGVINTDGTTNAAVTYDHMDGSMYNSV 

VAGGTTIHNVAAGSAADDAVNVSQMNAAISSVSNIIGSAGNPLFT^ 

NTVS VGSAGNERQVTNVAAGTATT DAVNVGQLNQAIGAS I GNLP AGMS AKDYT DQQINAVQNGVNQVAKNAYAGIAAATALTMI PDVD 
QGKT I AVGVGGGS YKGSQAVALG I S ARI TQNLKMKAGAGTS SQGTTVGLGAS YQW 

SEQ ID NO: 18 (EPEQ 

MLIQQNSEVINQLAGNTSETYIEENGASINYVRTNDTGLTFTDASAAGIGSTAVGYNTVAKGDNSVAMGYNSFAEGHSSVAIGQGSYS 

GVETSIALGSESVSSRVIVKGSRNTSVSEEGWIGYDTTDGELLGALSIGDDGKYRQIIOTADGSEMDAVTVRQLQNAI^ 

KYYHMSTAEDSLAVGEDSLMGMTIVNGNAGIGIGLOT 

QNSVGEFSVGSEDGQRQITIWAAGSADTDAVNVGQL^^ 
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ANAQGKDSVMGSGS IMADNSVM^ 
ITLGC^NGGTTRISWSAGWNNDAWYAQ 
IGGGTYNGESAVALGVSMVSANGRWVYKLQGSTNSQGEYSMLGAGIQW 

SEQ ID NO: 19 

GSGGGG 

GTACCGCACGAGCTGGCMA7\MGGCACGGCAGTCTCTTTTGTCGAAGCCCATGATTACAAGTTGCTAGGTAAMTCAAACGTTATAC 

TGAGGAAATTTTAAAGGCACGCATTTTAGCAGGTTTAGAACCTCGCACTAAGCCACCAAAAGATGGCGAAGTGAAATCTGTCAGCAAA 

AMCAAMGGCGCGCATTAMGAAAMCGTGMGATAAGAAAAAMCAGAGGCMAGAAAAMGTAAAATTGCGTCATMGGATACM 

AAAATATCGGCAAACGACGCAAGCCAAGTAATAGTAATATTTA ATTAGGTATGATGTAAATTCT 

MTTTTTCTATATTGCTTTAACGTTTTTTTATAGTAGMGTA^ 

CTAGTTCACTTTAATAACCTTAAATAATTG AGGAT TTCTTATGAAAAGAAATTTATTAAAACAATCTGTAATCGCTGTGTTGATAGGT 
GGCACTACTGTTTCTAATTATGCTTTAGCACAAGCACAAGCACAAGCACAAGTCAAAAAAGATGAACTTAGTGAGTTAAAGAZ\ACAAG 
TAAAGGAAATGGATGCTGCTATCGATGGTATTCTTGATGATAATATTGCTTATGAAGCTGAAGTTGATGCAAAACTTGATCAGCATTC 
TGCTGCTCTTGGTAGACATACAAATAGACTCAATAATCTTAAAACGATTGCAGAGAAAGCAAAAGGTGATTCAAGTGAAGCACTTGAT 
AAAAT TG AAGCT CTTGAAGAACAAAAT G ATGAGTTTTT AGCGGAT AT T ACAGCTT TAGAAGAGGGAGT TGATGGT TT AGATGATGATA 
TCGCAGGT AT TCAAGAT AATATT TCTGAT ATAGAAGATGAT AT T AAT CAAAATTCTGC AGACAT CGCAACTAAC ACAGCGGCAATCGC 
AACTCACACTCAACGTCTTGATAATTTAGATAACAGAGTAAATAACCTTAATAAAGATCTTAAACGTGGTCTTGCTGCTCAAGCTGCA 
TTAAATGGTTTATTCCAACCGTATAACGTAGGTAAATTAAATCTTACTGCTGCTGTAGGTGGTTATAAATCTCAAACTGCAGTTGCTG 
TAGGTAC 

SEQ ID NO: 21 (Haemophilus somnus) 

ATGAAAAAAGTACAATTTTTTAAATATTCATCATTGGCATTAGCATTGGGTTTAGGGGTAAGTGCTTCTGCTTTGGCAGCCCCAACAA 

GTACAAGTACGACTACTGGACCAGAGGCGCCTCCTACAGGCCCTGCTCCTACGGCGAAAGACCCTCTAGCAGAAACAGCGTTAGCCTA 

TGATTTGGAGAACGAAGTTGCGTATCTTCGTATGAAGGCGGGTGAGTGGATGCAATTGGGGCTTGATCCTGAAAAAGAAGTCATCAAA 

GGCTGGAATGAGGTAAAATCTCTCCCTCGTATCGATGGAAATGGAAAGGATAAACAGACAAAAGATCAAATAGCAATGTTGATAAGAA 

CGGTTGATAATACAAAAGAGCTTGGTCGGATCGTTAGTACAAACATTGAAGATATTAAGAACCTTAAAAAAGAGCTTTACGGTTTTGT 

AG AAGATGTGAACGAG AGTGAAG CACGC AAT AT CT CAAGAAT AGATGAGAATGAG AAAG AT ATT AAGAACCTT AAAAAAGAGCT TTAC 

GATTTTGTAGAAGATGTGAACGAGAGTGAAGCACGCAATATCTCAAGAATAGATGAAAATGAGAAGGACATTAATACTCTTAAAGAGC 

TAATGGATGAGGATTTAAATTCAGTCTTAACCCAAATTGAAGATGTAAAACTCACATTTCAAGATGTCAATGATAACGTTAATTTGGC 

ATTTGAAGAGATTAATGGAAATGCCCAAAAGTTTGACACTGCTATTGAAGGACTTACTTCAGGTTTGAGCGATTTACAAGCTAAAGTC 

GATGCAAATAAACAAGAMCTGAAGACGATATTGCGGACAATGCCAAGGCTATTCATAGCAACACAAAAGGTATTGCTTVAAAATACC^ 

AGGATATTCGTGACTTGGACACCAAAACCAAGCAAATGTTGGAAAATGACAAAAACTTGATGACCGGTTTAGAATCTTTAGCAACAGA 

AACAAGCAAAGGCT T TG AAAGATT TGAT GTCAAAACACAACAATT AGATCAAGCCGT CGC AAAT GT CGTCGGTCGAGTAGACAT AACT 

GAGCAAGCTATTCGCCAAAACACTGCAGGCTTAGTCAATGTGAATAAACGTGTCGATACACTCGACAAAAACACCAAAGCCGGTATCG 

CTTCTGCAGTCGCTTTAGGTATGTTGCCACAATCCACTGCTCCGGGTAAATCATTAGTGAGCTTAGGTGTCGGTCATCACCGTGGGCA 

AAGTGCTACTGCTATTGGAGTATCTTCTATGAGCAGTAACGGTAAATGGGTTGTTAAAGGCGGTATGAGCTATGATACACAGCGTCAT 

GCTACTTTCGGCGGTTCTGTCGGTTTTTTCTTTAACTAA 

SEQ ID NO: 22 (Escherichia coM) 

ATGAAAACTGTAAACGTAGCTTTACTGGCACTCATAATTTCAGCAACATCCAGCCCTGTTGTTTTAGCTGGTGATACCATTGAAGCGG 
CGGCAACAGAGCTTTCAGCCATTAACTCTGGCATGTCGCAATCGGAGATTGAGCAGAAGATTACCCGCTTTTTAGAACGCACAGACAA 
CAGCCCCGCTGCGTATACCTATTTGACTGAACATCACTACATCCCTTCTGAAACACCTGATACCACTCAGACTCCCACTGTCCAGACA 
GATCCTGACGCAGGACAAAAAACCGTTGCCGCTACAGGTGATGTACAGACAACTGCCCGTTATCAGAGCATGATCAACGCCCGACAGT 
CTGCGGTAACTGACGCCCAGCAAACGCZy\ATTACAGAGCAACAGGCGCAGATCGTAGCCACACAAAAAACGCTCGCCGCGACTGGAGA 
TACGCAAAATACCGCGCATTATCAGGAAATGATTAATGCCAGACTGGCGGCTCAAAATGAGGCTAATCAGCGCACCGCCACTGAACAA 
GGGCAGAAAATGAATGCGCTGACAACCGATGTGGCAGTACAACAGCAAAATGAAAGGACTCAATACGATAAACAAATGCAAAGTCTGG 
CGCAGGAGTCTGCCCAGGCACATGAACAAATTGACAGCCTGTCACMGACGT 

ACGGGTTGCAGATAACAGCCAGCA7\ATTAACACGCTCAATAACCATTTCAGTTCGCTAAAAAACGAAGTTGATGACAATCGTAAAGAA 
GCCAATGCGGGAACTGCATCTGCCATCGCTATCGCCTCACAACCACAGGTTAAAACCGGTGACGTGATGATGGTGTCAGCGGGAGCGG 
GAACCTTCAACGGTGAATCTGCGGTGTCTGTCGGAACATCATTTAATGCCGGAACGCATACGGTACTTAAAGCCGGTATTTCTGCGGA 
TACACAATCTGATTTCGGCGCAGGTGTCGGCGTGGGATATTCGTTCTAA 
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p£ ID NO: 23 (Escherichia coif) 

ATCGCCAMCAGCGTCGGCGTCTGGGCGCAGTMGAGACTTGCTGACGGTAGATTTCTGGCTTTAGTGTGCTGACATCCTCACCTTCA 

MCAGTAACGTTCCGCTGGTTGGGCTGATCAATGAAGCAACTATTTTTAGCAGCGTACTTTTGCCACAACCAGMGGACCGGTMTTA 

ACTTAAATTCGCCAGCACGCAGCGAAAAATTGATGTTATTMGMTCTTCGCATCACCCGCCAGATATCCTACGTTTTGTAGCTGAAG 

CAMGGACTATTTTCCTGCATCGCTGTTCCCTTTTTCTGATTTTTACTAAAAACAGTTTATCCTTCGCAGGAATAAGGGGGAACTCTC 

TTTCAGTAATCAGGT/ V A'.v.C • • ' GO: 7 'jU) ' \!>: : l . • : a.-;.' V*GC \^x : j;a'^ 

TTATCCCGATTCTCAYTTTTGTCGCGCTGGT^ 

TTTTGGTCGCTATACCAAAACGTTACAGCCGGGGCTCAGTCTGGTGGTGCCGTTTATGGATCGCATTGGTCGCAAGATCAATATGATG 

GAGCAAGTGCTCGATATCCCTTCCCAGGAAGTTATCTCGAAAGATAACGCCAACGTTACCATCGACGCAGTCTGTTTTATTCAGGTGA 

TTGACGCGCCACGCGCGGCTTATGAAGTCAGCAATCTGGAGCTGGCGATCATCAACCTGACCATGACTAACATCCGTACCGTGTTGGG 

TTCAATGGAACTTGACGAAATGCTCTCTCAGCGCGACAGCATCAACTCACGCCTGCTGCGTATTGTCGATGAGGCCACCAACCCGTGG 

GGGATTAAAGTCACCCGTATTGAAATTCGCGACGTGCGCCCACCGGCAGAGCTTATCTCTTCAATGAACGCGCAGATGAAAGCGGAAC 

GTACCAMCGCGCTTACATTCTTGAAGCGGMGGGATCCGTCAGGCGGAAATCCTCAAAGCCGAAGGTGAAAAACAGTCGCAAATCCT 

GAAAGCGGAAGGCGAACGTCAGTCGGCGTTTTTACAGGCTGAAGCGCGTGAACGTTCCGCTGAAGCAGAAGCCCGCGCCACCAAAATG 

GTGTCTGAAGCCATCGCCTCCGGTGATATTCAGGCGGTGAACTACTTCGTAGCGCAGAAATACACCGAAGCGTTACAGCAGATCGGTT 

CCTCCAGTMCAGCAAAGTAGTGATGATGCCATTAGAGGCCAGCAGCCTGATGGGGTCGATTGCCGGGATTGCCGAGCTGGTGAAAGA 

CAGCGCCAACAAGCGGACTCAGCCATGATGGAGTTAATAGTCGTTCATCCACATATTTTCTGGCTCAGTCTCGGCGGTTTGCTGCTGG 

CAGCCGAGATGCTGGGCGGAAATGGTTATTTGTTGTGGAGTGGCGTGGCAGCAGTGATTACTGGCCTGGTGGTCTGGCTGGTGCCGCT 

GGGTTGGGAGTGGCAAGGGGTGATGTTTGCCGTCCTGACGCTGCTCGCCGCCTGGCTGTGGTGGAAATGGTTGTCGCGGCGGGTGCGC 

GAACAAAAGCACAGCGACAGTCATTTAAACCAGCGCGGGCAGCAGCTGATTGGCCGACGTTTTGTGCTGGAATCTCCGCTGGTCAACG 

GGCGCGGTCATATGCGCGTCGGTGACAGTTCATGGCCTGTCAGCGCCAGCGAGGATCTCGGCGCAGGTACGCATGTTGAAGTCATTGC 

GATAGAAGGGATAACGCTGATCATCCGTGCGGTCATCGCCTGATGCGACG CTGACGCG TCTTATCATGCCCGGAAGTCTGCGCCCGAA 

TCGTAGGCCGGATAAGGCGTTTACGCCGCATCCGGC^G 

CGTAGGCCGCCTCCGCCATGTTAAATGTTAACTGGCATT 

GGATAGTGTTTTTTTAGATATTCCAGGACGTTTTTATTGAC 

GGTAAACTTCTGGCGGAATGGTGAAATC^GAAAGCGTTAACCA^ 

CATAATCAGCGACATGGCAGAGGCAGGAGCCGTACTATCGCCGCTTAAATACTTCCACAC 

GGTAGCAATGCCCGGTCCAGATTCATTTC&TTAAAAATCT 

AAATTGAGATTTATCAAAGAAACGCATTTTAGCAC^^ 

AAAACAAGTCTTTACTCTTTTTTACATTGAAAGAGC^ 

GATGGTGTTTACGCTTACAACAGACAAAAATGCGCTTTACATCACACAAA 

TTTCTTAAAACAATATTATTTGTTTCTT^^ 

GCTTTACTGGCACTCATAATTTCAGCAACATCCAGCCCTGM 

CCATTAACTCTGGCMGTCGCAATCGGAGATTG^ 

CTATTTGACTGAACATCACTACATCCCTTCTGAAAC&C^ 

AAAACCGTTGCCGCTACAGGTGATGTAC^GACAACTGCCC 

AGCAAACGC&AATTACAGAGCAACAGGCGCAGATCGTA 

TTATCAGGAAATGATTAATGCCAGACTGGCGGCTCAAAATG 

CTGACAACCGATGTGG(^GTACAACAGCAAAATGAAAGG^ 

CACATGAACAAAOTGAGAGCCTGTCAC^yiGACGTA^ 

CCAGCAAATTAACACGCTCAATAACCATTTCAGTTCGCT^ 

TCTGCCATCGCTATCGCCTCACAACCACAGGTTAAAACCGGTGACGTGATGATGGTGTCAGCGGGAGCGGGAACCTT^ 

CTGCGGTGTCTGTCGGAACATCATTTAATGCCGGAACGC^^ 

CGCAGGTGTCGGCGTGGGATATTCGTTCTAATATTTC^ 

TAATTTGTACATTATTTAAGGTTAACAATGATGAATAGCTCCATTAAATCGTTTTCCCTGCTGGCGGTTATAOTACTGGCTGGCTGTA 

GTTCACCCACTTCCCGCATCGCAGATTGCCAGGCGCAGGGC^ 

TATTTTAAGTGGATCCGAGGCACAGGCATTTAAAAATGCAGAAGCCGCACAACAC 

TTTGGCATGACCTTTAGAATGAGCAGTAAAAACTTTGCTTATCT 

CTGTTTATCAGTCAGGTCTATATAACGTCATTGTTTA^ 

TTTAAAATGAAGGAGCAAAGGAAAATACCCC^^ 

TAGCCTCCCTGGTTCACGCTGTGAATGTTAACAACGAAATCCAGGAAGGCTTATTTCAGTCGGGGCGCAT 
GCATATTCTOTCGGTGCAAACGGGGAT^ 

GTCGGCGCTGTCATCACCGGGGCAGGCATTCGCCAGCGCCAGTAGCTGGTTGCGCATAGATTGCAGTTCTTCGATATGCCGTTCAATC 
TCCGCCACCTTCTCCAGCGTGCGACGTTTGACGTCGGCACTGTGACGCTGCGGGTCGTTAAACAGATTCACCAGCTCGCCGCTCTCTT 
CCAGGTTAAAGCCCACCTGGCGCGCCTGGCGCAGTAAGGTCAATTCGTTGAGATGCTGCTGCGTGTAGGTTCGATAACCATTTTCGCT 
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^kcatcggcggcgtcaccagccccttct^ 
^Katcgttcgcgcaacgcc 

SEQ ID NO: 24 (EPEC) 

ATGMMCTGTMACGTAGCTTTACTGGCACTCATMTTTCAGCAACATCCAGCCCTTTTGTTTTAGCTGGTGATACCATTGZ\AGCGG 
CGGCA^CAGAGCTTTCAGCCATTMCTCGGGCATGTCGCMTCGGAGATTGAGCAGAAGATTACCCGCTTTTTAGAACGCACCGACAA 

v» . . L'»j.' i ■.. v . .■.»'■«• . . A, . v . - j. ..'.jj.£l\.L.. ^wW.'. ■- v# j. v .x..w« «. • • .* >....-.• , i/>>..a ..^U 

GAICCYGACGCAGGACA/VMU^ACCGTTUCCOC^^^ 

CTACGGTAACTGATGCCCAGCAAACGCAAATTACAGAGCAACAGGCGCAGATCGTAGCCACACAAAAAACGCTCGCCGCGACTGGAGA 
TACGCAAAATACCGCGCATTATCAGGAGATGATTAATGCCAGGCTGGCGGCTCAAAATGAGGCTAATCAGCGCACTACCACGGMCAA 
GGGCAGAAAATGAATGCACTGACAACCGATGTGGCAGCACAACAGCAAAAAGAAAGGGCTCAATACGATAAACAAATGCAAAGTCTGG 
CGCAGAAGTCTGTCCAGGCACATGAGCAAATTGAAAGTCTGAGACAAGATTCCGCACAAACGCAGCAACAGTTAACCAACACCCAAAA 
ACGGGTTGCAGATMCAGCCAGCAMTTAACACGCTCAATMCCATTTCAGTTCGCTAAAAAACGMGTTGAGGACMTCGTAMGAA 
GCCAATGCGGGAACTGCATCTGCCATCGCTATCGCCTCACMCCACAGGTGAAAACCGGTGACTTGATGATGGTCTCAGCGGGAGCGG 
GAACCTTTAACGGTGAATCTGCGGTGTCTGTCGGAACATCTTTTAATGCCGGAACGCATACGGTACTTAAAGCAGGTATTTCTGCGGA 
TACACAATCTGATTTCGGTGCGGGTGTCGGCGTGGGATATTCGTTCTAA 

SEQ ID NO: 25 (EAEC) 

ATGAAAACTGTAAAGCTGTCTTTACTGGCTGTCGTTGTTGCTACCGCGGTAAGTCCATCTGCGTTTGCGGGTGATACTGTTGAGGCGG 

CAACGACAG AAT T AACGGT AAT CCAGCC AGGAATGT CGCAAT CGGAAATTGAT CAGAAAATTGGT CGATT TTT AGAAAGGACAGGGAA 

TAGTGTAGCCGCACAAAATTATCTGATTGCGCATGATTACCAGACAACGACGCCTCAGGAAAATACAGCTGCTTCTCCCGTACAGCCC 

ACCAATACGCTGAACCCGATAACCAATCAAGCGCAGACCGACCGCGACAACGGGCAGGATACCGCCATTCAGGACGCGCAGCACGCCG 

CCAACTGGGCTTCACTGAMGCTGATGACGCGCAGCACGCCATCACGGTGGCGCAGACGGATATTGATGCCAATACAGCCGCCATCAC 

CGATACCCGTAATGATGTCTCCGCAGTGCAGTCAGACGTCACCAACATAAAAGGCGATGTCGCACATGCCCAGTCAACGGCTGACCAT 

GCCAACGCTAACGCCAACACCGCTCTGATTAACGGCGTCAAACTTTCCGGTGCTGTGACAGAAAACAAAAACAACATCGAACAGAACC 

GCAGCGATATTGCTGACCAGCAGAAACTGTTGGCATCAAACGAGCAAAAACAGATCGTCCGCGACAACGGGCAGGATACCGCCATTCA 

GGACGCACAACATGCCGCCAACTGGGCTTCACTGAAAGCTGATGACGCGCAACACGCCATCACGGTGGCGCAGACGGATATTGATGCC 

AATAAAGCCGCCATCACCGACATCCGTAATGATGTCTCCGCAGTGCAGTCAGACGTCACCAACATAAAAGGCGATGTCGCACATGCCC 

AGTCAACGGCTGACCATGCCAACGCTAACGCCAACACCGCTCTGATGAACGGCGTCAAACTCTCCTCTGCTGTGACAGAAAACAAAAA 

TAATATCGAACAGAACCGCAGCGATATTGCTGACCAGCAGAAACTGTTGGCATCAAACGAGCAAAAACAGATCGTCCGCGACAACGGG 

CAGGATACCGCCATTCAGGACGCACAACATGCCGCCAACTGGGCTTCAATGAAAGCTGATGACGCGCAGCACGCCATCACGGTGGCGC 

AGACGGATATTGATGCCAATAAAGCCGCCATCGCCGACACCCGTAATGATGTCTCCGCAGTGCAGTCAGACGTCACCAACATAAAAGG 

CGATGTCGCACATGCCCAGTCAACGGCTGACCATGCCAACGCTAACGCCAACACCGCTCTGATTAACGGCGTCAAACTTTCCGGTGCT 

GTGACAGAAAACZYAAAATAATATCGAACAGAACCGCAGCGATATTGCTGACCAACAGCAACAACTCGACGAAACCCGGAAAATCGTTG 

CCGCCACCGGGGATGTGCAGACTGCCGCCCGATATCAGAGCATGATTGACGCCAGACAGACAGCCGCTGCCAATGCGCAACAGGCACA 

AGCTGACACCCAGCAGCAACAAATGGACGATCAGCAGAAACAAATCGACGCGACGCAAAAAACGGTTTCCGCACTTGGCGATGCCCAG 

ACCAACGCACATTATCAAGAGATGGTTAACGCCGGACTGAGAGCACAAAATGATGCGAATGCGCGTACTGCAGCAGAACAAAAACAAA 

AAATAGATACTCTGGCGACTAACCAGGCAACGCAACAGCATATCAATAGTGTGCAGTACGGGGAACAAATTCAGCGTCTGGCGCAAGA 

CTCAACACAAACGCATGAACAAATTGACAGCCTGACACAAGACGTAACCCAAACGCATCAGCAGTTAAGCAACACGCAAAAACGAGTA 

GCGGATAATAGCCAGCAGATTACTACGCTCAATAACCATTTCAGTTCGCTGAAAAACGAAGTTGAGGACAACCGTAAAGAAGCCAATG 

CGGGAACTGCATCAGCCATCGCTATCGCCTCACAACCACAGGTGAAAGCCGGTGACTTTATGATGATGTCAGCGGGAGCGGGAACCTT 

CMCGGTGAATCTGCGGTGTCTGTCGGAACATCTTTTAATGCCGGAACGCATACCGTGATTAAAGCCGGTGTCTCTGCGGATACGCAA 

TCTGATTTCGGCGCGGGTGTCGGCGTGGGATATTCTTTCTAA 

SEQ ID NO: 26 (UPEC) 

ATGAACAAAATATTTAAAGTTATCTGGAATCCGGCAACAGGCAGTTACACCGTTGCCAGCGAAACGGCGAAGAGCCGTGGTAAAAAAA 
GCGGGCGCAGTAAGCTGTTAATTTCTGCACTGGTTGCGGGTGGGTTGTTGTCGTCGTTTGGGGCAAGTGCAGATAATTACACTGGGCA 
GCCAACTGATTATGGCGATGGCTCAGCAGGTGACGGCTGGGTTGCTATCGGTAAAGGGGCAAAAGCAAATACCTTTATGAACACTAGT 
GGCGCGAGTACAGCTTTAGGATATGACGCGATAGCCGAAGGTGAGTACAGTTCTGCCATCGGGTCAAAAACCCTTGCAACTGGTGGAG 

CATCCATGGCGTTCGGGGTTAGTGCAAAAGCMTGGGTGACAG^ 

GATGGCTTTTGGTCGTTACGCAAAGACGAATGGTTTTACATCTCTTGCTATTGGGGACTCCTCCCTTGCCGATGGTGAAAAZVACTATT 
GCGTTAGGAAATACGGCTAAAGCTTACGAAATTATGAGCATCGCCCTCGGTGATAATGCCAATGCGTCAAAAGAGTATGCAATGGCGC 
TGGGAGCAAGTAGCAAAGCTGGCGGTGCTGATAGCCTCGCATTCGGCAGAAAATCTACAGCTAATAGCACTGGCTCACTGGCAATAGG 
TGCTGACAGTAGCAGTTCGAACGATAACGCCATCGCGATAGGGAACAAAACGCAAGCCCTGGGAGTGAATTCGATGGCCCTGGGTAAT 
GCMGTCAGGCATCTGG£GMTCCAGTATTGCATT 

GCATTGCAAGCAAAGTGAACTCAATCGCGTTGGGAAGTAACAGTTTGTCCTCGGGAGAGAATGCCATCGCATTGGGAGAGGGTAGTGC 
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KTGGTGGCAGCMCAGCCTTGCTTTCGGTAGCCAGTCCAGGGCAAACGGCMTGATTCTGTCGCCATCGGTGTAGGGGCTGCAGCA 
ACCGACAATTCTGTCGCTATCGGCGCAGGATCGACCACAGATGCAAGCAATACGGTTTCAGTTGGCAACAGCGCAACAAAACGCA 
MATTGTTAATATGGCTGCTGGTGCCATMGCAACACCAGTACCGATGCCATCMCGGCTCACAGCTTTATACGATCAGTGATTCAGT 
CGCCMGCGACTCGGAGGAGGCGCTACTGTAGGCAGCGATGGCACCGTAACCGCAGTAAGCTACGCGTTGAGAAGCGGMCCTATAAT 
AACGTGGGTGATGCTCTGTCAGGMTCG?VCMTAATACCCTACAATGGAATAAMCCGCGGGGGCGTTCAGCGCCMTCACGGTGCA?V. 
ri ' ' ■ ' ^^'''Tr/'^mGGTACGGTTTC^ r ; : 

- 1 '/ ':C-:;C;kCAGCATTCAGTGCCG^ a:/- ; \> 

GGCAACCTGACTGCCGGCAGCACTGACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGACAACGTGACGACCAACACCACCAACA 
TCGCCACTAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAMGCAGCTGG 
CGCATTCAGCGCCGCGCACGGCACCGAAGCCACCAGCAAAATCACCAACGTCACCGCTGGCAACCTGACTGCCGGTAGCACTGACGCC 
GTTAACGGCTCCCAGCTCAAAACCACCAACGACAACGTGACGACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAACC 
TGACTGACGCTGTTMCGGTCTCGGTGACGACTCCCTGCTGTGGAACAAMCAGCTGGCGCATTCAGCGCCGCGCACGGCACTGACG^ 
CACCAGCMGATCACCMCGTCACCGCTGGCAACCTGACTGCCGGC^GCACTGACGCCGTTAACGGCTCCCAGCTCAAAACCACCAAC 
GACAACGTGACGACCAACACCACCAACATCGCCACTAACACCACCAATATCACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACG 
ACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCGCCGCGCACGGCACTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGG 
TGACCTGACAGCTGGCAGCACTGACGCCGTTAACGGCTCTCAGCTCAAAACCACCAACGATAACGTGTCGACCAACACCACCAACATC 
ACCAACCTGACTGACGCTGTTAACGGTCTCGGTGACGACTCCCTGCTGTGGAACAAAACAGCTGGCGCATTCAGCGCCGCTCACGGCA 
CTGACGCCACCAGCAAGATCACCAATGTCAAAGCCGGTGACCTGACAGCTGGCAGCACTGACGCCGTTAACGGCTCCCAGCTCAAAAC 
CACCAACGATAACGTGTCGACCAACACCACCAACATCACTAACCTGACGGATTCCGTTGGCGACCTTAAGGACGATTCTCTGCTGTGG 
AACAAAGCGGCTGGCGCATTCAGCGCCGCGCACGGTACCGAAGCTACCAGCAAGATCACCAACTTACTGGCTGGCAAGATATCTTCTA 
ACAGCACTGATGCCATTAATGGCTCACAACTTTATGGCGTAGCGGATTCATTTACGTCATATCTTGGTGGTGGTGCTGATATCAGCGA 
TACGGGTGTATTAAGTGGGCCAACCTACACTATTGGTGGTACTGACTACACTAACGTCGGTGATGCTCTGGCAGCCATTAACACATCA 
TTTAGCACATCACTCGGCGACGCCCTACTTTGGGATGCAACCGCAGGCAAATTCAGCGCCAMCACGGCATTAATAATGCTCCCAGTG 
TAATCACTGATGTTGCAAACGGTGCAGTCTCGTCCACCAGCAGCGACGCCATTAACGGTTCACAACTTTATGGTGTTAGTGACTACAT 
TGCCGATGCTCTGGGCGGGAATGCTGTGGTGAACACTGACGGCAGTATCACTACACCAACTTATGCCATCGCTGGCGGCAGTTACAAC 
AACGTCGGTGACGCGCTGGAAGCGATCGATACCACGCTGGATGATGCTCTGCTGTGGGATACAACAGCCAATGGCGGTAACGGTGCAT 
TTAGCGCCGCTCACGGGAAAGATAAAACTGCCAGTGTAATCACTAACGTCGCTAACGGTGCAGTCTCTGCCACCAGCAACGATGCCAT 
TAATGGCTCACAGCTCTATAGCACTAATAAGTACATCGCTGATGCGCTGGGTGGTGATGCAGAAGTCAACGCTGACGGTACTATCACT 
GCACCGACTTACACCATTGCAAATACCGATTACAACAACGTCGGTGAAGCCCTGGATGCGCTCGATAATAACGCGCTGCTGTGGGATG 
AAGACGCAGGTGCCTACAACGCCAGCCATGATGGCAATGCCAGCAAAATCACCAACGTTGCGGCTGGTGATCTCTCCACAACCAGTAC 
CGATGCTGT TAACGGTT CCCAGTTAAACGCAACCAATATTCTGGT TACGCAAAATAGCCAAATGATTAACCAGCTTGCTGGTAACACT 
AGCGAAACCTACATCGAGGAAAACGGTGCGGGTATTAACTATGTACGTACCAACGACAGCGGCTTAGCGTTCAACGATGCCAGCGCTT 
CAGGTATTGGCGCTACAGCTGTAGGTTATAACGCAGTTGCCTCTCATGCCAGCAGTGTAGCCATCGGTCAGGACAGCATCAGCGAAGT 
TGATACGGGTATCGCTCTGGGTAGCAGTTCCGTTTCCAGCCGTGTAATAGTTAAAGGGACTCGTAACACCAGCGTATCGGAAGAAGGT 
GTTGTGATTGGTTATGACACCACGGATGGCGAACTGCTTGGCGCGTTGTCGATTGGTGATGACGGTAAATATCGTCAAATCATCAACG 
TCGCGGATGGTTCTGAAGCCCATGATGCGGTCACTGTTCGCCAGTTGCAAAACGCCATTGGTGCAGTCGCAACCACACCAACCAAATA 
CTATCACGCCAACTCAACGGCTGAAGACTCACTGGCAGTCGGTGAAGACTCGCTGGCAATGGGCGCGAAAACCATCGTTAATGGTAAT 
GCGGGTATTGGTATCGGCCTGAACACGCTGGTTCTGGCTGATGCGATCAACGGTATTGCTATCGGTTCTAACGCACGCGCAAATCATG 
CCGACAGCATTGCAATGGGTAATGGTTCTCAGACTACCCGTGGTGCGCAGACCAACTACACTGCCTACAACATGGATGCACCGCAGAA 
CTCTGTGGGTGAGTTCTCTGTCGGCAGTGAAGACGGTCAACGTCAGATCACCMCGTCGCAGCAGGTTCGGCGGATACCGATGCGGTT 
AACG T G GGT C AG i 'I G AAAG T AACG G ACGCGC AGGTTTCC CAG AAT ACCC AG AG CAT '± ACT AACCT G AAC AC TCi VGG'i 1 CACI'AA'i' C'x' C G 
ATACTCGCGTGACCAATATCGAAAACGGCATTGGCGATATCGTAACCACCGGTAGCACTAAGTACTTCAAGACCAACACCGATGGCGC 
AGATGCCAACGCGCAGGGTAAAGACAGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACAACAGCGTCGCACTGGGCACGGGT 
TCCGTAGCAGACGAAGAAAACACCATCTCTGTGGGTTCTTCTACCAACCAGCGTCGTATCACCAACGTTGCTGCCGGTGTTAATGCCA 
CCGATGCGGTTAACGTTTCGCAACTGAAGTCTTCTGAAGCAGGCGGCGTTCGCTACGACACCAAAGCTGATGGCTCTATCGACTACAG 
CAACATCACTCTCGGTGGCGGCAATAGCGGTACGACTCGCATCAGCAACGTTTCTGCTGGCGTGAACAACAACGACGCAGTGAACTAT 
GCGCAGTTGAAGCAAAGTGTGCAGGAAACGAAGCAATACACCGATCAGCGCATGGTTGAGATGGATAACAAACTGTCCAAAACTGAAA 
GCAAGCTGAGTGGTGGTATCGCTTCTGCAATGGCAATGACCGGTCTGCCGCAGGCTTACACGCCGGGTGCCAGCATGGCCTCTATTGG 
TGGCGGTACTTACAACGGTGAATCGGCTGTTGCTTTAGGTGTGTCGATGGTGAGCGCCAATGGTCGTTGGGTCTACAAATTACAAGGT 
AGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGTGCCGGTATTCAGTGGTAA 

SEQ ID NO: 27 (EHEC) 

ATGAACAAMTATTTAAAGTTATCTGGMCCCTGCGACAGGGAATTATACTGTTACCAGCGAMCGGCAAAMGCCGTGGCAAGAAAT 
CTGGGCGCAGTAAGCTGTTAATTTCTGCGCTGGTTGCGGGTGGAATGTTGTCGTCGTTTGGGGCATTGGCGAATGCCGGGAATGACAA 
CGGTCAGGGTGTTGATTACGGTAGTGGATCAGCTGGCGACGGCTGGGTTGCTATAGGCAAAGGGGCGAMGCAAATAC^ 
ACCAGTGGTTCCAGTACTGCTGTGGGTTATGACGCTATAGCTGAAGGCCAATATAGCTCTGCCATCGGGTCAAAAACCCATGCGATTG 
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GTGCATCMTGGCCTTTGGGGTTAGTGCMTATCAGMGGCGATAGAAGTATAGCACTGGGTGCCTCTTCGTATTCATTGGGCCA 
CTCAATGGCCCTCGGCCGTTATTCAAAAGCATTGGGTAAATTGTCTATTGCTATGGGGGACTCTTCCAAAGCGGAAGGAGCAAAC 



GCCATTGCCCTGGGAAATGCCACTAAAGCTACTGAGATTATGAGTATTGCTCTTGGCGACACCGCCAATGCGTCAAMGCGTATTCAA 
TGGCGCTGGGAGCMGTAGCGTCGCATCTGAAGAAMCGCTATTGCGATAGGTGCTGAGACCGMGCCGCTGAAMTGCMCTGCTAT 
TGGCAATAATGCGAAGGCAAAAGGGACTAATAGCATGGCAATGGGGTTCGGAAGCCTTGCCGATAAAGTCAATACTA.TCGCATTAGGA 

7» '!\ 1i rc,r ? r.r.C]\ GGCTO r «?GGCAG ATAATf ]m }A 5,1 r:r-:c;C n T?\nGnr;/ VGfGC ?yACMAGCTGATGGCGTGGATGCC 7 V.C C- , r : T'GGTA'A.TG 

- • •.. •< •': .i.'. • . ;«c ; ggt gataagagt c± . ' : 

CAGTGCCAACGGTATT AACT CTGTCGCGCTGGGCGCAGATTCCATTGCGGATT TAGACAATACCGTCTCTGTCGGu/ia: v 1 1 kvj 1 1 UAi i A 

AAACGCAAGATCGTTAATGTGAAAAATGGCGCGATCAAGTCTGACAGTTACGATGCCATTAATGGTTCACAGCTTTATGCCATTAGCG 

ACTCGGTAGCMAAAGGCTTGGAGGAGGGGCTGCAGTAGATGTTGATGACGGTACTGTTACAGCACCAACCTACAATTTAAAAAATGG 

TAGCAAAAATAACGTAGGGGCTGCGCTCGCTGTACTTGATGAAAACACCCTGCAATGGGACCAAACCAAAGGCAAATACAGCGCTGCT 

CATGGTACTAGTAGCCCAACTGCCAGCGTAATCACCGATGTTGCGGATGGCACGATTTCAGCCTCCAGTAAGGATGCGGTTAACGGTT 

CCCAACTGAAAGCTACCAATGACGATGTCGAAGCCAACACCGCCAATATCGCTACTAATACCAGCAACATTGCCACGAATACGGCAZVA 

TATTGCCACCAATACCACCAATATCACCAACCTGACGGATTCCGTTGGTGACCTTCAGGCTGATGCCCTGCTCTGGAACGAAACTAAA 

AAGGCATTCAGTGCAGCTCACGGCCAGGATACCACCAGCAAAATCACCAACGTTAAAGATGCCGACCTGACGGCTGACAGCACTGATG 

CTGTTAACGGCTCTCAGCTGAAAACCACCAACGATGCTGTGGCGACGAATACCACCAATATCGCCAATAACACTTCCAATATTGCCAC 

TAACACCACCAACATCTCTAACCTGACTGAGACGGTGACTAATCTTGGTGAGGATGCGCTGAAATGGGATAAGGACAATGGTGTATTC 

ACGGCAGCTCATGGCACCGAGACCACCAGCAAAATCACCAACGTTAAAGATGGCGACCTGACKACTGGCAGCACCGATGCCGTTAACG 

GCTCTCAGCTGAAAACCACCAACGATGCCGTGGCGACGAATACCACCAATATCGCCACTAACACCACCAACATCTCTAATCTGACTGA 

GACGGTGACTAATCTTGGTGAGGATGCGCTGAAATGGGATAAGGACAATGGTGTCTTCACTGCAGCTCATGGCAACAATACCGCCAGC 

AAAATCACCAATATCCTGGACGGCACAGTCACTGCAACCAGTTCCGATGCCATTAACGGTAGCCAGCTTTATGACTTAAGCAGCAATA 

TCGCCACCTACTTCGGCGGCAATGCTTCTGTGAATACTGACGGTGTGTTTACCGGTCCAACCTACAAAATCGGTGAAACAAATTATTA 

TAACGTCGGCGATGCACTGGCTGCGATTAACTCCTCATTTAGCACGTCTCTCGGCGATGCTCTGCTTTGGGATGCCACCGCAGGTAAA 

TTCAGTGCCAAACACGGTACTAATGGTGACGCAAGCGTGATCACTGATGTCGCAGATGGTGAAATTTCAGACTCCAGTTCTGACGCAG 

TAAACGGCTCACAACTCCACGGCGTGAGCAGTTATGTTGTTGATGCGCTGGGGGGTGGTGCCGAAGTCAATGCAGACGGCACCATCAC 

TGCGCCGACGTACACCATTGCTAATGCTGATTACGATAATGTCGGTGATGCCCTGAATGCTATCGATACCACTCTTGACGACGCTCTG 

CTCTGGGATGCGGACGCCGGTGAAAATGGTGCATTTAGCGCCGCTCACGGAAAAGATAAAACTGCCAGTGTAATCACTAACGTCGCTA 

ACGGTGCAATCTCTGCTGCCAGCAGCGACGCGATTAACGGCTCACAACTCTATACCACCAATAAGTACATCGCTGATGCGCTGGGTGG 

TGACGCAGAAGTCAACGCTGACGGCACCATCACCGCACCGACTTACACCATTGCGAACGCCGAGTACAACAACGTCGGTGACGCCCTG 

GATGCGCTTGATGATAACGCCCTGCTGTGGGATGAGACTGCCAATGGCGGTGCTGGAGCCTACAATGCCAGCCATGACGGTAAAGCCA 

GCATCATCACTAATGTCGCTAATGGCAGTATTAGTGAGGACAGTACCGATGCAGTGAACGGTTCTCAGTTGAATGCGACGAATATGAT 

GATTGAGCAG7\ACACCCAAATTATCAATCAGCTCGCTGGTAACACCGACGCAACCTATATCCAAGAAAACGGTGCGGGTATTAACTAT 

GTGCGTACTAACGACGACGGCTTAGCGTTCAACGACGCCAGCGCACAGGGTGTTGGCGCTACAGCTATAGGTTAT2VACTCTGTCGCCA 

AAGGCGATAGCAGCGTAGCTATTGGTCAGGGCAGCTACAGCGACGTTGATACGGGTATCGCCCTGGGTAGCAGCTCTGTTTCCAGCCG 

AGTGATTGCCAAAGGCTCCCGTGACACCAGCATAACGGAAAATGGCGTTGTTATTGGTTACGACACCACGGATGGCGAACTGCTCGGT 

GCATTGTCTATCGGTGATGACGGTAAATATCGTCAAATCATCAACGTAGCCGATGGTTCCGAAGCCCATGACGCCGTTACGGTTCGTC 

AATTGCAGAATGCGATTGGTGCGGTCGCAACCACGCCGACTAAATACTTCCACGCTAATTCAACGGAAGAAGATTCACTGGCAGTGGG 

AACTGACTCGCTGGCAATGGGTGCGAAAACCATCGTGAATGGCGATAAAGGTATTGGTATCGGTTATGGTGCCTACGTGGACGCGAAT 

GCACTTAACGGCATTGCCATTGGTAGCAATGCGCAAGTCATTCATGTCAACAGTATTGCGATAGGTAATGGTTCTACGACCACTCGTG 

GCGCTCAAACCAATTATACCGCCTACAACATGGACGCACCGCAGAACTCTGTCGGTGAATTCTCAGTCGGTAGTGCGGATGGTCAACG 

AATACCCAGAGCATTACTAACCTGGATAATCGGGTAACGAATCTTGATTCACGCGTCACCAATATCGAAAACGGTATTGGCGATATCG 
TCACCACCGGTAGCACCAAGTACTTCAAGACCAATACCGATGGTGTAGATGCCAGCGCGCAGGGTAAAGATAGCGTCGCGATTGGTTC 
CGGCTCCATTGCTGCCGCTGACAACAGCGTCGCTCTGGGTACAGGGTCTGTGGCAACCGAAGAAAATACGATCTCTGTAGGTTCCTCT 
ACTAACCAACGTCGTATCACCAACGTAGCTGCAGGTAAAAATGCTACCGATGCTGTTAACGTGGCACAGTTGAAGTCTTCCGAAGCTG 
GCGGTGTACGTTACGACACCAAAGCTGATGGTTCTATCGACTATAGCAATATCACCCTCGGTGGCGGCAACGGCGGTACGACTCGTAT 
CAGCAACGTCTCCGCTGGCGTCAACAACAACGACGTGGTGAATTACGCGCAGTTGAAGCAAAGCGTGCAGGAZ^ACGAAGCAATACACC 
GATCAGCGAATGGTTGAGATGGATAACAAACTGTCTAAAACTGAAAGCAAGTTGAGCGGTGGTATCGCTTCTGCAATGGCAATGACCG 
GTCTGCCGCAGGCTTACACTCCAGGTGCCAGCATGGCCTCTATTGGTGGCGGTACTTACAACGGTGAATCGGCAGTTGCTTTAGGTGT 
ATCGATGGTGAGCGCCAATGGTCGTTGGGTCTACAAATTACAAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGTGCC 
GGTATTCAGTGGTAA 

SEQ ID NO: 28 (EAEC) 

ATGAACAAAATTTTTAAAGTTATCTGGAATCCGGCAACAGGCAGTTACACCGTTGCCAGTGAAACGGCAAAAAGCCGTGGCAAGAAAT 
CCGGGCGCAGTAAGCTGTTAATTTCTGCCCTGGTTGCT 

CGGTCAGGGCGTCAATTATGK3CCAGGGAACAGGAACAGGCTGGGTTGCTATCGGTGAAGATGCAAMGCAAATAGTTTTACAGA 



36 



f 



^fcGGTGGCTCCAGCACCGCAGTAGGTTATCACTCMCTGCTGACGGAAGGTGGTCMCAGCCCTCGGTGCTAAGACCCATTCGCTTG 

^TOAAGCATCAGTGGCGTTAGGGATTAACACTACCTCAGCAGGTGAGCGAAGCCTCGCTATAGGTGCATCGGCAACGTCAACGGGTGG 
ATTTTCAATCGCACTTGGTCGTTATGCGAACTCAGTAGGTGAATTTTCTATTGCACAAGGGGATCACGCAGAGACTGGTGCTGATGAC 
GCCATCGCATTTGGACGTGAAAGCAAGGCCTTAGGAATTATGAGTATCGCCCTCGGAGCCACGGCTAATGCGTCGAAAGMTATGCAA 

ACl7\ACGCGGACCCAAATGGACT VaAi'AG iHi CoCGi''i i GGGCTCCGGCAGTATTGCAGATG , lCGACAaCAuGx-i.i i'Gu AC ibboCiii^ ^ 
AAAGTCAGGCAGTAGCGGCTGGCGCAATTGCCATCGGTCAAGGGAATAAAGCTGACGGCGCAAATGCTATCGCGCTGGGTAATGGTAG 
TATTACAGGTGGTGTAAATGCTATTGCTCTTGGACAAGGCAGTTATGCCGGTTTAGAAAATGGCACTGCAATTGGTGCTCAAGCCAGT 
GCTCAGGGGAAAAATTCAGTTGCTCTGGGTGCTGGTTCTGTAGCGACTGACGCGGATACTGTTTCTGTGGGTAACACAACAGCTCAGC 
GACAAATTGTCAAT ATGGC AGCAGGT GAT ATCAGCACT ACCAGTACTGATGCCATCAAT GGAT CACAGCTT T ATGCTAT CAGT AAGTC 

AGTAGCGGACAATCTTGGTGGGGGGGCTACCGTCfiATGCGCMGGCGTCGTTACTTCCCCAAATTACAGGCTGAAAAGTGGTATTTTC 

GGCACTGTTGGCGACGCCTTAACGGGCCTGGACAATAATACGTTACAATGGGACTCCCTTAAAAAGGCATATAGTGCGGCACATGGTA 

CAGATACTACCAGTACCATCACCAACGTTAAAGACGGCGCTATTTCTGATACCAGTAAGGATGCGGTTAACGGTTCTCAGCTGAAAAC 

CACCAACGATAACGTAGCGACCAATACTGCCAATATCACCACCAACACTAACAGTATCAATACCCTGACGGATTCCGTTGGCGACCTT 

AAAGACGATGCCCTGCTGTGGAATGGCACCGCGTTCAGCGCCGCGCACGGCACCGAAGCCACCAGCAAAATCACCiyVCGTCAAAGACG 

GTGACCTGACGGCTGGCAGCACCGACGCGGTTAACGGCTCTCAGCTCAAAACCACTAACGATAATGTGGCAACCAACACCACCAACAT 

CACCAACCTGACGGATTCCGTTGGCGACCTTAAAGACGATGCCCTGCTGTGGAATGGCACCGCGTTCAGCGCCGCGCACGGTACCGAT 

GCCACCAGCAAAATCACCAACGTCAAAGACGGTGACCTGACGGCTGGTAGCACTGACGCGGTAAACGGCTCTCAGCTGAAAACCACTA 

ACGATGCCGTGGCAGCCAACACCACCAATATCGCCACGAATACCACCAACATCACCAACCTGACTGACGCAGTTGACAGCCTCGGTGA 

CGATTCCCTGCTGTGGAACGCTACAGCGGGGGCATTCAGCGCCGCACACGGTACTGATGCCACCAGCAAAATCACCAACGTCACCGCT 

GGCGACCTGACGGCTGGCAGCACCGACGCGGTTAACGGCTCTCAGCTCAAAACCACTAACGATGCCGTGGCAGCCAACACCACCAATA 

TCGCCACGAACACCACCAACATCACCAACCTGACTGACGCTGTTGACAGCCTCGGTGACGATTCCCTGCTGTGGAACGCTACTGCGGG 

GGCATTCAGCGCCGCGCACGGTACTGATGCCACCAGCAAAATCACTAACGTCAAAGACGGTGACCTGACGGCTGGCAGCACTGACGCG 

GTTAACGGCTCTCAGCTCAAAACCACTAACGATGCCGTGGCAGCCAACACCACCAATATCGCCACGAACACCACCAACATCACCAACC 

TGACTGACGCTGTTGACAGTCTCGGTGACGATTCCCTGCTGTGGAACGCTACGGCGGGTGCATTCAGTGCCAAACACGGCACCAACGG 

TACTGACAGCAAAATCACCAACTTACTGGCAGGCACTGTATCCTCTGATAGCACTGACGCTATTAATGGCTCACAACTTTATGGCTTA 

GCTGATTCATTTACGTCATACCTTGGCGGTGGTGCTGATATCAGCGATGCGGGTGTATTAACCGGGCCAACCTACACTATTGGTGGTA 

CTGATTACAATAACGTCGGTGATGCTCTGGCTGCCATTAACACGTCATTTAGCACATCACTCGGTGACGCCCTACTCTGGGATGCGAC 

CGCCAAGGGCGGTGACGGTGCGTTTAGCGCCGGTCGCGGGACAGATAACACAGCCAGCATCATTACTAATGTTGCAGACGGTGCAATT 

TCCTCTACCAGCAGCGACGCT ATTAACGGCT CACAGCT CTATGACACCAGCAAGTACATTGCCGATACT CTGGGTGGTGACGCAGAAG 

TCAATGCTGACGGCACAATCACCGCACCGACTTATGCCATCGCTGGCGGCAGTTACAGCAACGTCGGTGACGCGCTGGAAGCGATCGA 

TACCACGCTGGATGACGCTCTGTTGTGGGATGCAACAGCCAATGATGGCAATGGTGCATTTAGCGCCGCTCACGGAAAAGATAAAACA 

GCCAGTGTAATCACTAACGTCGCTAACGGTGCAATCTCTGCCACCAGCAGCGATGCCATTAACGGTTCACAACTGTATACCACCAACA 

AGTACATTGCCGATGCCCTGGGTGGTGACGCAGAAGTTAACGCTGATGGTTCTATTACTGCGCCGACTTACACCATTGCAAATGCCGA 

GTACAACAACGTCGGTGACGCCCTGGATGCGCTCGACGATAACGCTCTGCTGTGGGATGCAACAGCCAATGACGGCGCAGGTGCCTAC 

AACGCCAGCCATGACGGCAAGGCCAGCATCATCACAAATGTTGCTGATGGTAACATTGGCGAAGGCAGCACTGACGCCATCAACGGTT 

CTCAGCTGTTTAACACCAATATGCTGATCCAGCAGAACAGCGAAATCATCAATCAGCTTGCCGGTAACACCAGCGAAACCTACATCGA 

AGATAACGGTGCGGGCATTAACTATGTACGTACCAACGACAACGGCTTAGCGTTCAACGATGCCAGCGCTTCAGGTATTGGCGCTACG 

GCTGTGGGTTATAACGCTGTCGCCTCAGGCGAAAGCAGCGTAGCCATTGGTCAAGGTAGCAGCAGCftACGTTGATACGGGTATCCCCC 

TGGGTAGCAGTTCCGTTTCCAGCCGTGTAATAGTTAAAGGTTCTCGTGACACCAGCGTGTCGGAAGAAGGTGTTGTG/VTTGGTTZiTGA 

CACCACGGATGGCGAACTACTTGGTGCGTTGTCTATTGGTGATGACGGTAAATATCGTCAAATCATCAACGTAGCCGATGGTTCCGAA 

GCCCATGACGCCGTTACGGTTCGCCAGTTGCAAAATGCCATTGGTGCAGTCGCTACCACGCCGACCAAATACTTCCACGCCAACTCAA 

CGGAAGAAGATTCACTGGCAGTAGGTGAAGACTCGCTAGCAATGGGCGCGAAAACTATCGTTAATGGTAATGCGGGTATTGGTATCGG 

TTATGGTGCCTACGTGGACGCGAATGCACTTMTGGCATTGCTATCGGTAGCAACGCGCGTGCAAACCATGCAAACAGCATTGCTATG 

GGTAATGGCTCACAGACGACTCGTGGTGCTCAAACTGGCTACGCCGCCTACAACATGGACGCACCGCAGAACTCTGTGGGTGAGTTCT 

CTGTCGGCAGTGAAGACGGTCAACGTCAGATCACCAACGTCGCGGCTGGTTCGGCTGATACCGATGCGGTTAACGTGGGTCAGTTGAA 

AGTAACGGACGCGCAGGTTTCCCAGAATACCCAGAGCATTACTAACCTGMCAATCAGGTCACTAATCTGGATACTCGCGTTACTMT 

ATCGAAAACGGTATTGGCGACATTGTAACCACCGGTAGCACCAAGTACTTCAAGACCAACACCGATGGCGTAGATGCCAACGCGCAGG 

GTAAAGATAGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACMCAGCGTCGCGCTGGGTACAGGCTCCGTGGCCAACGAAGA 

AAATACCATCTCTGTGGGTTCTTCTACCMCCAGCGTCGTATCACCAACGTTGCTGCAGGTGTTAATGCCACCGATGCGGTTAACGTT 

TCGCAGCTGAAGTCTTCTGAGGCAGGCGGCGTTCGTTACGACACCAAAGCTGATGGCTCTGTAGAGTACAGCAACATCACTCTCGGTG 

GTGGTAATGGCGGTACGACTCGCATCAGCAACGTCTCCGCTGGCGTGAACAACAACGACGCAGTGAACTATGCGCAGCTGAAGCAAAG 

CGTGCAGGAAACGAAGCAATATACCGATCAGCGGATGGTTGAGATGGATAACAAACTGTCCAAAACCGAAAGCAAGTTGAGCGGTGGT 

ATCGCTTCTGCAATGGCAATGACCGGTCTGCCGCAGGCTTACACTCCAGGTGCCAGCATGGCCTCTATT<5GTGGCGGTACTTACAACG 
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^^ATACTCCGCCGCACTCGGTGCCGGTATTCAGTGGTAA 
SEQID NO: 29 (Bonrkholderia fnumgorwim) 

ATGMCMGAC^TATCGCTCCGTCTCG^ 

.......... ■ > .■ - ■ . . . k . 

TCCGACTATCAATGGGGTTTGTTCAGCAACAC^ 

ATGACGGCTCATTGCTGTTGTACGGTCCCAACAACATTGTGATGAAGAACCTGGTCAGCATGTCCAGCAACAAGATCATCAATCTTGC 

CCCTGGGACAGTGAGCTCTACCAGCGCAGACGrGGTAAACGGCTCGCAGTTGTATGCGACGMCgAGMCGTGTCGMC^TCGGAAAC 

ACGGTGAACAACATCACCACCGGTGCCGGCATCATGTACTTCCACGTGAACTCGACGCTGGCGGATTCGACCGCGAACGGCGTGAATT 

CTATTGCTATCGGTGGCGCGACCAGAACGGACGCGAACAATTCGATTTCGATCGGTACGGGACTGACGCAAGCCTCAAGCAATACAGG 

GGCTATCGCAATTGGTCAGAATGCGAGCATCAATGTATACGGCGCGAATAGTATCGCTATCGGCACAAACAGTGCGACGGGTGGCATT 

GGCGGTGCGATTGCGCTCGGCGAGAACGCCTTTGCAACCGGCGGCAAGATGTTGGCGCTCGGCTCCGGTGCGAGTGCGACGACGGCTA 

ATTCGGTCGCGCTAGGCTCAGGTTCGACGACGACGGCGAATTTGACAGCAGCAGGATATAACCCTGGCAGCGGCACGCTTGCCGGTAC 

GTCACAGGCTACGAATGGCGAAGTGTCGGTGGGCAATGCAGGTGCAGAGCGTCGTATCACTAACGTTGCGGCCGGTTCGGCAGCCACG 

GATGCGGTGAACGTGAGCCAGTTGCAATCGGAAGATGCAAAGGTGAACACGATCAACAACAACGTGAACAACCTGAGCGGCAGTGTCA 

CCMCATCAGCAGCACGGTGAACAACATCACCAACGGTGGTGGCATCAAGTATTTCCACGCGAACTCGACGCAGGCGGATTCGTCGGC 

CACGGGCACGGATGCAGTGGCGATCGGTGGCAATGCCCAGGCGACGGCGGCGAACTCGGTGGCGCTGGGTTTGAACTCGACGAGTAAG 

GGMCGAACGCGATTGCACTGGGCGGGGCCGTGGCTGGGGGGAGCTACGCTTTCGCGGCAGGCAGCCTGGCATTGGCTGCAACTACTG 

GTGACATTGCGCTGGGTTCATCGGCGACGGCTAGCAGCGCTAACAGCAATGCTTACGCAACTGCGCTGGGTACCAACGCGTTGGCAAA 

CGCAACCGATGCAACTGCCATTGGCGAAGGTGCCTCGGCAACAGCAGCATCCTCGGTAGCGTTGGGCGCGAGATCCAAAACGACAGCG 

AACCTGTCGACAGCGGGCTATAACCCGGGTACGGGGACGTTGTCCGGTACTACCCCGACGGGCGAAGTGTCGGTGGGCTCGGCAGGCA 

AGGAACGACGCGTGACGAACGTGGCAGCCGGCTCGGCAGCGACGGATGCGGTGMCGTGAGCCAGTTGATGTCGGAAGACGCAAAGGT 

GAACACGATCAACAACAACGTGAACAACCTGAGCAACAACGTCACGAACATCGCGGGCAATGTCACCAACATCAGCAACACGGTGAAC 

AACATCACCAATGGTGGTGGCATCAAGTACTTCCACGTGAACTCGACGCTGGCGGATTCTTCCGCGGGGGGAACAAATTCTATAGCTA 

TCGGCGGCGGGGCTACTACGGGGAATGTTACTGCAGGAACTTCAGACAATATATCCATCGGCACCAACGCCACAACCPJ\CTATGGCAA 

AAATATTGCCATCGGCGGCAATGCGCAGGCCTTGGGAGGCGCATACGACGGTGGCTACAACACTGCGATTGGTGAGAACGCGATTGCA 

AAGGGAGATGGAGCAGGTGGCTTCGGCGGAGGGGGGTGGGGGCAGACCACGGCCATTGGCGGTGGTTCGCAGGCATTGCATGACAACA 

CCACGGCGGTCGGTTCAGGCGCGATCGCAAATGTGGCTAATGCTACTGCTTTGGGTATGTCTGCCAGCGCGACCGCTGGGAGTGCTAT 

CGCGCTGGGTCAGGGGGCAGTGGCGTCGGCAGCGAACTCGGTGGCACTGGGTTCGGGTTCGACGACGACGNCGAACCTGTCGGCAGCG 

GGTTATAACCCGGGTACGGGTACATTGTCGGGCATCGCTTCGGTAGCCAATGGCGAAGTGTCGGTGGGTGCAGCAGGCAAGGAACGTC 

GCATCACCAACGTGGCAGCCGGCTCGGCAGCCACGGATGCAGTGAACGTGAGCCAGTTGCAATCGGAAGATGCAAAGGTGAACACGAT 

CAACAACAACGTGAACAACCTGAGCGGCAGTGTCACCAACATCAGCAACACGGTGAACAACATCACCAACGGTGGTGGCATCAAGTAT 

TTCCACACGAAATCGACGCTGGCCGATTCGTCGGCGACGGGTACGGATGCAGTGGCGATCGGCGGCAATGCCCAGGCGACGGCGGCGA 

ACTCGGTAGCACTGGGTTCGAACTCGACGACGACAGCGAACCTGTCGGCAGCGGGTTATAACCCGGGCACGGGTGCGTTGTCTGGCAT 

CGCTTCGGCAGCCAACGGTGAAGTGTCGGTGGGTGCAGCAGGCAAGGAACGCCGCATCACTAACGTAGCAGCCGGCTCGGCAGCCACG 

GATGCGGTGAACGTCAGCCAGCTCCAGTCGGAAGACGCAAAGGTGAACACGATCAGCAACAACGTGAACAACCTGAGCGGCAGCGTCA 

CCAACATCAGCAGCACGGTGAACAACATCACCAATGGTGGTGGCATCAAGTACTTCCACACGAACTCGACGCTGGCGGATTCGACCGC 

GAACGGCGTGAATTCTATTGCTATCGGTGGCGCGACCAGAACGGACGCGAACAATTCGATTTCGATCGGTACGGGACTGACGCAAGCC 

TCMGCA2Viiic^(rc:G:;M^cGv;;:-. .re -;v «:v -r;,,:V;:Gr.c: rTATCfu *7 T * cgccgcg aatagtatcc ct atc i r •. r ::; :v\.cr.T:G 

CGACGGGTGGCATTGGCGGTGCGATTGCGCTCGGCGAGAACGCCTTTGCAACCGGCGGCAAGATGTTGGCGCTCGGCTCCGGTGCGAG 

TGCGACGACGGCTAATTCGGTCGCGCTAGGCTCAGGTTCGACGACGACGGCGAATTTGACAGCAGCAGGATACAACCCGGGCAGCGGC 

ACGCTTGCCGGTACGTCACAGGCTACGAATGGCGAAGTGTCGGTGGGCAATGCAGGTGCAGAGCGTCGTATCACTAACGTTGCGGCGG 

GCTCGGCAGCCACGGACGCGGTGAACGTGAGCCAGTTGCAGTCGGAAGATGCGAAGGTGAACACGATCAACAACAACGTGAACAACCT 

GAGCAACAACGTCACAAACATCGCGGGTAACGTCACCAACATCAGCAACACGGTGAACAACATCACCAATGGTGGTGGCATCAAGTAT 

TTCCACACGAAATCGACGCTGGCCGATTCGTCGGCGACGGGCACGGATGCAGTGGCGATCGGCGGCAATGCCCAGGCGACGGCGGCGA 

ACTCGGTAGCACTGGGTTCGAACTCGACGACGACAGCGAACCTGTCGGCAGCGGGCTACAACCCGGGCACGGGTACGCTGTCCGGTAC 

TACCCCGACGGGCGAAGTGTCGGTGGGCTCGGCAGGCAAGGAACGTCGCGTGACGAACGTGGCAGCCGGCTCGGCGGCGACGGATGCG 

GTGAACGTGAGCCAGTTGCAGTCGGCCATCATCGGCAGCACCGCGAACGCGGTCGCCTATGACGACGGCACGAAGGCCACGGTTACGC 

TGAAGGGCGCGAGCGGCACGAAGATCACCAACCTGACGGCGGGCAATCTGAGCGCAACGAGCACGGACGCGGTGAACGGCTCGCAGTT 

GTACGCGACGAACCAGAACGTGTCGAACATCGGTAACACGGTGAACAACATCACCAACGGTGGTGGCATCAAGTATTTCCACGCGAAC 

TCGACGCAGGCGGATTCGTCGGCCACGGGGTCGAACTCGGTCGCGGTGGGCGACAGGGCGTCGTCGCTTGGCGGTTCGTCGGTCGCGA 

TGGGTGACGGCGCGACAGCAGTGGGCGCGGCGAGTATTGCGATTGGTAACAATGCGCAGAACGTGACCGGGTCGAACAATTCAGTTGC 

CATTGGAGGTGATTCGMGGCGGGCGAC^GGTCCGTTTCACTGGGCAACGGTGCAGATACCTCGCTGTCGAGCTGGGGCGTTGCA 

GGTACCMCGCGMCGTTTCTGCTGC^CTGGGCACGGCAATTGGTGCCGGGGCCAACGTGAGCGGCGCGM 
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^•^CGCGGTGGCCTCTGCGACCMCTCGGTGGCGCTG^TTCGAATTCGACGACGACGGCGMCCTGTCGGCTGCGGGTTATAACCC 

^GTACGGGTACGTTGTCGGGCATCGCGTCGGCGGCCAATGGCGAAGTGTCGGTCGGTGCAGCAGGCAAGGAACGTCGCGTGACGAAC 
GTGGCAGCCGGCTCGGCAGCCACGGATGCGGTGAACGTGAGCCAGTTGCAGTCGGAAGATGCGAAGGTGAACACGATCMCAACAACG 
TGAACAACCTGAGCGGCAGTGTCACCAACATCAGCAGCACGGTGAACAACATCACCAACGGTAGCGGCATCAAGTACTTCCACACGAA 

CTCGACGTTGGcrn7\ r r^^ 

; ... t i *r : , <r < m r\ q-r t, • . r ■ ' t ' ( ' ' ' ■ I ' • ' f ■ ■ r. t ■- . -•, • 

;-^.^.i«, ww Ow , wJww*2*w^* J : w^j * ... w . . . . - ... ^ a. ^ o^^ri^Libvi ±uC^"iUuik l iv>'OvUi CJOC* u^ui * *w» ... .>« .... . ^ . ^ ... .... 

CMCACCGTAATCAATGGCGTCAACTCGGTAGCGCTCGGGATCTCGGCGACGACGAGCGGGAGCGGCAGCATGGCCTTTGGCTCGGCG 



CGACGACGGCCAACCTGTCGGCAGCGGGra 

GGTGGGCTCGGCAGGCAAGGAACGTCGCGTGACCAACGTGGCAGCCGGTTCGGCAGCGACGGATGCGGTGAACGTCAGCCAGCTCCAG 

TCGGAAGACGCCAAGGTGAACACGATCAACAATAACGTGAACAACCTGAGCAACAACGTCAGCAACATCGCGGGCAACGTCACCAACA 

TCAGCAACACGGTGAACAACATCACCAACGGTGGCGGCGGCATCAAGTACTTCCACGCGAACTCGACACTCGCCGATTCGTCGGCAAC 

GGGCACGGATGCAGTGGCGATCGGCGGGAATGCCCAGGCGACGGCGGCGAACTCGGTAGCACTGGGTTCAAACTCGACGACGACGGCA 

AACCTGTCGGCAGCGGGCTACAACCCGGGCACGGGTACGCTGTCCGGTACGACCCCGGTGGGCGAAGTGTCGGTGGGCTCGGCAGGCA 

AGGAACGTCGCGTGACAMCGTGGCGGCCGGCTCGGCGGCCACGGATGCGGTGAACGTGAGCCAGTTGCAGTCGGCCATCATCGGCAG 

CACCGCGAATGCGGTCGCCTATGACGACGGCACGAAGGCCACGGTTACGCTGAAGGGCGCGAGCGGTACGAAGATCACCAACCTGACG 

GCAGGTAATCTGAGCGCMCGAGCACGGACGCGGTGAACGGCTCGCAGTTGTATGCGACGAACCAGAACGTGTCGAATGTCGGTAAC?\ 

CGGTCAGTAACCTGAGCAACAACGTCACGAACATCGCGGGTAACGTCACCAACATCAGCAACACGGTGAACAACATCACCAATGGTGG 

TGGCATCAAGTATTTCCACGCGAACTCGACGCTCGCCGATTCGTCGGCGACGGGCACGGATGCAGTGGCGATCGGTGGCAATGCCCAG 

GCGACGGCAGCGAACTCGGTGGCGCTGGGTTCAAACTCGACGACGACGGCGAACCTGTCGGCAGCGGGCTATAACCCTGGCACGGGTG 

CGTTGTCCGCTACGACCCCGGTGGGCGAAGTGTCGGTGGGCTCGGCAGGCAAGGAACGTCGCGTGACGAACGTGGCAGCCGGCTCGGC 

AGCGACGGATGCAGTGAACGTCAGCCAGTTGATGTCCGAAGATGCCAAGGTGAACACGATCAACAACAACGTGAACAACCTGAGCAAC 

AACGTCAGCAACATCGCGGGTAACGTCACCAACATCAGCAATACGGTGAACAACATCACCAACGGTGGCAGCGGCATCAAGTACTTCC 

ACGCGAACTCGACGCTGGCGGATTCGTCGGCAACGGGCGTTGACGCAGTGGCGATCGGCGGCAATGCCCAGGCGACGGCAGCGAACTC 

GGTAGCACTGGGTTCGAACTCGACGACGACAGCGAACCTGTCGGCAGCGGGTTATAACCCGGGCACGGGTGCGTTGTCTGGCATCGCT 

TCGGCAGCCAACGGTGAAGTGTCGGTGGGTGCAGCAGGCAAGGAACGCCGCATCACTAACGTAGCAGCCGGCTCGGCAGCCACGGATG 

CGGTGAACGTCAGCCAGCTCCAGTCGGAAGACGCGAAGGTGAACACGATCAACAACAACGTGAACAACCTGAGCAACAACGTCAGCAA 

CATCGCGGGCAACGTCACCAACATCAGCAATACGGTGAACAACATCACCAACGGTGGCAGCGGCATCAAGTACTTCCACGCG2VACTCG 

ACACTCGCCGATTCGTCGGCAACGGGCACGGATGCAGTGGCGATCGGTGGGAATGCATCGGCATCGGCGGCAAACTCGGTGGCGCTGG 

GTTCGAACTCGACGACGACGGCGAACCTGTCGGCAGCGGGATACAACCCGGGTTCGGCAGCACTGTCGGGCACGGCCTCGGCAGCCAA 

CGGCGAAGTGTCGGTCGGTGCAGCAGGCAAGGAACGCCGCATCACGAACGTAGCAGCCGGCTCGGCAGCCACGGATGCGGTGAACGTG 

AGCCAGCTCCAGTCGG2\AGACGCGAAGGTGAACGCTGAAGGCGCGGCCACTGCGGCAGCGCTGGGCGGCGGTTCGACCTACAACACGA 

CGACGGGTGCGATCACCAGTCCGACGTACATCGCAGGCGGCAAGACGTTCAACAATGTTGGCGATGTAGTCACGAACATCGACGGCCG 

TGTTACGCAGAACTCGACGGACATCACGAACCTGACTACGACCATCGACAACGGCACGATCGGTCTGGTGCAGCAGGCTACGCCGACG 

AGCACGATTACGGTCGCGAAGGACACGGGCGGCGCGACGGTGGATTTCCGGGGCACGGGCAATGCAACTCGCACGTTGACGGGCATCA 

CGGCGGGTGAGTTGAGCGCAACGAGCACGGATGCGGTGAACGGCTCGCAGTTGTACGCGACGAACCAGAACGTGTCGAACATTGACAA 

CACGGTCAGTAACCTGAGCAACAACGTCACGAATATCGCGGGCAATGTCACCAACATCAGCAACACGGTGAATAACATCACCAACGGT 

GGTGGCGGCATCAAGTACTTCCACGCGAACTCGACGCTGGCGGATTCGTCGGCAACGGGCGTTGACGCAGTGGCGATCGGCGGCAATG 

CCCAGGCGACGGCAGCGM.CTCGG m AGO^C^ 
GGGCACGTTGrcVGGC/A'^CGC^ 

GCCGGCTCGGCAGCCACGGACGCGGTGAACGTGAGCCAGTTGCAATCGGAAGATGCCAAGGTGAACACGATCAACAACAACGTGAACA 
ACCTGAGCAACAACGTCAGCAACATCGCGGGCAATGTCACGAACATCAGCAACACGGTGAACAACATCACCAACGGTGGCGGCGGCAT 
CAAGTACTTCCACGCGAACTCGACGCTGGCGGATTCGTCGGCGACGGGCACCAACAGCCTGGCGGCCGGACCGGCAGCGGTGGCATCC 
GCCACTGACGCAGTCGCACTGGGCAACGGCGCGAAGGCAACCAACGCCGGCGCGGTGGCACTCGGCGCAGGTTCGACGACAACGACCG 
CGGTGGCAACCTCCGGCACGACGATTGGCGGCATCACCTACACGTTCGCAGGCGTCGCTCCGTCGAGCACGGTGAGCGTGGGTGCGGC 
GGGTAGCGAGCGCACGATCACGAACGTGGCGGCTGGCCGCCTGAGCGCGACGAGCACGGACGCGGTCAACGGCAGCGAACTGTTTGCA 
ACCAACCAGCAGGTGACGCGAAACACCGCGGACATCACCAACCTGACGAACAACATGAACATCGGTTCGGTGGGTCTGGTGCAGCAGG 
ACGCGACGACACGCACCATCACGGTCGCGAAGGCCACCGACGGTACGCGGGTCGACTTCACGGGCACGGGGGGCGCGCGTCAATTGAC 
GGGCGTGGCCGCAGGCGCAGTGAACGCGACGAGCGTGGATGCGGTGAACGGTTCGCAGCTCTACGGTGTGTCGCAGAGCGTGGCGGAT 
GCGATCGGCGGTGGCTCGACCGTGAACACGGATGGCTCCATCTCGGCCCCGACCTACGTTGTGGACGGCACGACCGTCCACAATGCGG 
GCGACGCGATCAGCAACCTCGACAACCGTGTGACGCAGAACACCACCGACATCAGCACGATCAACAACACGCTGAACAGCATCACCAC 
GGGTGCGGGCGTCAAGTACGTGCATGTG^CTCGACCCTGGC^^ 

GCGCAATCGCAGGCGGCGAACTCGGTGGCATTGGGTTCGAACTCGGTGGCGGATCGTGCCAACACGGTGTCGGTGGGCGCGGCTGGTG 
CGGAGCGTCAGATCACCMCGTGGCGGCCG^TACGGCGGACACGGA 

TACGGATGGCACGACC7UVCGCCGCCGTCACGTACGACCACAACGCGGACGGCTCGGCCAACTACAACAGCGTCACGATGGGTAACGGT 
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^•rgcgggcggcacgacgatccacaacgtggcggcaggctcggctgccgacgacgcggtcaacgtcagccagatgaacgcggcgatct 

^agcgtctcgaacatcatcggctcggcgggtaatccgctgttcacagccgacggcaaccgtgacacggaagccgcggtcgccagcgg 
cacgcatgccacggcaatgggcgcgaacgcgaaggcaagcgcggccaattcggtggcgctgggtgcgaactcggtggcggaccgtgag 
mtacggtgtcggtgggctcggcgggcmcgaacgccaggtcaccmtgtcgcggccggtacggccaccaccgacgccgtcaacgtcg 

GCCAGTTGMCCAGGCGATCGGCGCATCGATCGGCM 

GGTGCAGMCGGCGTGMCCAGGTTGCGMf^' 1 ^^- ■ : ' ■ ■ :'-^W- T ^PATTCCGGACGTCGAT 

AGAACCTGMGATGAAGGCCGGCGCGGGTACGAGCTCGCAAGGCACGACGGTGGGCCTGGGTGCTTCCTACCAGTGGTAA 
GSGGGG 

SEQ ID NO: 30 {^J,C) 

ATGCTGATCCAGCAGAACAGCGAAGTCATCAATCAGCTTGCCGGTAACACCAGTGAAACCTACATCGAAGAAAATGGTGCAAGTATTA 
ACTATGTGCGT ACCAATG ACACCGGT T T AACCT TC ACCG ATGCCAGCGCAGCAGGTATTGGCTCTACCGCTGTGGGGT AT AACACTGT 
TGCCAAAGGCGATAACAGCGTGGCCATGGGTTATAACTCTTTTGCCGAAGGCCATAGTAGCGTGGCCATCGGTCAGGGCAGCTACAGC 
GGCGTTGAGACGAGTATTGCGCTGGGTAGCGAATCCGTCTCCAGCCGCGTGATTGTTAAAGGTTCTCGTAACACGAGCGTATCGGAAG 
AAGGTGTTGTGATTGGTTATGACACCACGGATGGCGAACTGCTTGGCGCATTGTCGATCGGTGATGATGGTAAATATCGTCAAATCAT 
CAACGTCGCGGATGGTTCTGMGCCCATGACGCTGTTACTGTTCGCCAGTTGCAAAACGCCATTGGTGCAGTAGCAACCACACCAACC 
AAATACTATCACGCTAACTCMCGGCTGAAG^ 

GTAATGCGGGTATTGGTATCGGCCTGA^.CACTTTAGTTCTGGCTGATGCGATCAACGGTATTGCTATCGGTTCTAACGCACGCGC2\AA 
TCATGCCGACAGCATTGCTATGGGTMTGGTTCTCAGACTACTCGTGGTGCGCAGACCAACTACACTGCCTACAACATGGATGCACCG 
CAGAACTCTGTGGGTGAGTTCTCTGTCGGCAGTGAAGACGGTCAACGTCAGATCACCAACGTCGCAGCGGGGTCGGCGGATACCGATG 
CGGTTAACGTGGGTCAGTTGAMGTAACGGACGCGCAGGTTTCCCAGAATACCCAGAGCATTACTAACCTGAACACTCAGGTCACTAA 
TCTGGATACTCGCGTGACCAATATAGAAAACGGCATTGGCGATATCGTAACCACCGGTAGCACCAAATACTTCAAGACCAACACCGAT 
GGCGTAGATGCCAACGCGCAGGGTAAAGACAGTGTTGCGATTGGTTCTGGTTCCATTGCTGCCGCTGACAACAGCGTCGCGCTGGGTA 
CAGGCTCCGTGGCCAACGAAGAAAATACCATCTCTGTGGGTTCTTCTACCAACCAGCGCCGTATCACCAACGTTGCTGCCGGTGTTAA 
TGCCACCGATGCGGTTAACGTTTCGCAACTGAAGTCTTCTGAAGCAGGCGGCGTTCGCTACGACACCAAAGCTGATGGCTCTATCGAC 
TACAGCMCATCACTCTCGGTGG^GG m MCGGC^^ 

ACTACGCGCAGTTGAAGCAAAGTGTGCAGGAAACGAAGCAATACACCGATCAGCGCATGGTTGAGATGGATAACAAACTGTCCAAAAC 
TGAAAGCAAGCTGAGTGGTGGTATCGCTTCTGCAATGGCGATGACCGGTCTGCCGCAGGCTTACACGCCGGGTGCCAGCATGGCGTCT 
ATTGGTGGCGGTACTTACAACGGTGAATCGGCTGTTGCTTTAGGTGTGTCGATGGTGAGCGCCAATGGTCGTTGGGTCTACAAATTAC 
AAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGTGCCGGTATTCAGTGGTAA 

SEQ ID NO: 31 (Shigella) 

ATGAACAAAATATTTAAAGTTATCTGGAACCCTGCGACAGGGAATTATACTGTTACCAGCGAAACGGCAAAAAGCCGTGGCAAGAAAT 
CTGGGCGCAGTAAGCTGTTAATTTCTGCGCTGGTTGCGGGTGGAATGTTGTCGTCGTTTGGGGCATTGGCGAATGCCGGGAATGACAA 
CGGTCAGGGTGTTGATTACGGTAGTGGATCAGCTGGCGACGGCTGGGTTGCTATAGGCAAAGGGGCGAAAGCAAATACTTTTATGAAC 
ACCAGTGGTTCCAGTACTGCTGTGGGTTATGACGCTATAGCTGAAGGCCAATATAGCTCTGCCATCGGGTCAAAAACCCATGCGATTG 
GTGGTGCATCAATGGCCTTTGGGGTTAGTGCAATATCAGAAGGCGATAGAAGTATAGCACTGGGTGCCTCTTCGTATTCATTGGGCCA 
ATACTCAATGGCCCTCGGCCGTTATTCAAAAGCATTGGGTAAATTGTCTATTGCTATGGGGGACTCTTCCAAAGCGGAAGGAGCAAAC 
GCCATTGCCCTGGGGCGTAGCAGTGTAGCTAGCGGTACTGACAGCCTCGCATTTGGCAGACAATCACTTGCCAGCGCAGCGAACGCTA 

T r i*r~ , .(T > ~i\'VT\f~'f~' r "VT" 1 * V - ' A O v « fT* ' ' ' ' *r ' ■ ■' ■■" ' " ' p r *f~< 1" 7 * . •< ~ l *■• . y; ,. ■ t t ■ ■ • . •• • nr-»r »«• > /-> "■</'i7\ r\ ■ \ t> r-.r*t~< * \ r"7i" v f ■' M "n rpfr" /-»t> *n n 
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GGGGTTCGGAAGCCTTGCCGATAAAGTCMTACTATCGC^ 

CAGGGCAACAAAGCTGATGGCGTGGATGCCATCGCTCTGGGTAATGGTAGCCAGTCGAGAGGCTTAAACACCATTGCCTTAGGCACAG 
CCAGTAATGCAACTGGTGATAAGAGTCTTGCGCTTGGTAGTAATAGCAGTGCCAACGGTATTAACTCTGTCGCGCTGGGCGCAGATTC 
CATTGCGGATTTAGACAATACCGTCTCTGTCGGCAATAGTTCATTAAAACGCAAGATCGTTAATGTGAAAAATGGCGCGATCAAGTCT 
GACAGTTACGATGCCATTAATGGTTCACAGCTTTATGCCATTAGCGACTCGGTAGCAAAAAAGGCTTGGAGGAGGGGCTGCAGTAGAT 
GTTGATGACGGTACTGTTACAGCACCAACCTACAATTTAAAAAATGGTAGCAAAAATAACGTAGGGGCTGCGCTCGCTGTACTTGATG 
AAAACACCCTGCAATGGGACCAAACCAAAGGCAAATACAGCGCTGCTCATGGTACTAGTAGCCCAACTGCCAGCGTAATCACCGATGT 
TGCGGATGGCACGATTTCAGCCTCCAGTAAGGATGCGGTTAACGGTTCCCAACTGAAAGCTACCAATGACGATGTCGAAGCCAACACC 
ACCAATATCACCAACCTGACGGATTCCGTTGGTGACCTTCAGGCTGATGCCCTGCTCTGGAACGAAACTAAAAAGGCATTCAGTGCAG 
CTCACGGCCAGGATACCACCAGCAAAATCACCAACGTTAAAGATGCCGACCTGACGGCTGACAGCACTGATGCTGTTAACGGCTCTCA 
GCTGAZ^AACCACCAACGATGCTGTGGCGACGAATACCACCAATATCGCCAATAACACTTCCAATATTGCCACTAACACCACCAACATC 
TCTAACCTGACTGAGACGGTGACTAATCTTGGTGAGGATGCGCTGAMTGGGATAAGGACMTGGTGTATTCACGGCAGCTCATGGCA 
CCGAGACCACCAGCAAAATMCCMCGTTAAAGATGGCGACCTGACGACTGGCAGCACCGATGCCGTTMCGGCTCTCAGCTGAAAAC 
CACCMCGATGCCGTGGCGACGMTACCACCAATATCGCC^ 

GGTGAGGATGCGCTGAAATGGGATAAGGACAATGGTGTCTTCACTGCAGCTCATGGCAACAATACCGCCAGCAAAATCACCAATATCC 
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^KACGGCACAGTCACTGCMCCAGTTCCGATGCCATTMCGGTAGCCAGCTTTATGACTTMGCAGCAATATCGCCACCTACTTCGG 
^RcMTGCTTCTGTGMTACTGACGGTGTGTTTACCGGTCCMCCTACAAAATCGGTGAAACAAATTATTATAACGTCGGCGATGCA 
CTGGCTGCGATTAACTCCTCATTTAGCACGTCTCTCGGCGATGCTCTGCTTTGGGATGCCACCGCAGGTAAATTCAGTGCCAAACACG 

GTACTAATGGTGACGCAAGCGTGATCACTGATGTCGCAGATGGTGAAATTTCAGACTCCAGTTCTGACGCAGTAAACGGCTCACAACT 

cc;; .M'-zcc r : ; ; ; v. \ ^.^ 

iv • r - • :■ •'*■■■< ■ r ( X : ( ; C ;t 0 ?\ATGCT ATCGATACCACTCCTGACG ACGCTCTGCTCTGGGATGCG C ' 

- . \. . .. . . . ;•_:(. J 'iGTGTAATCACTAACGTCGCTAACGG'.CC. C: . 

TGCCAGCAGCGACGCGATTAACGGCTCACAACTCTATACCACCAATAAGTACATCGCTGATGCGCTGGGTGGTGACGCAGAAGTCAAC 
GCTGACGGCACCATCACCGCACCGACTTACACCATTGCGAACGCCGAGTACAACAACGTCGGTGACGCCCTGGATGCGCTTGATGATA 
iYCGCCCflGCTGYGGGATMGACT 

CGCT AATGGCAGTAT TAGTGAGGACAGT ACCG ATGCAGTGAACGGT TCTCAGT TGAATGCGACGAATATGATGATTGAGCAGAACACC 

CAAATTATCAATCAGCTCGCTGGTAACACCGACGCAACCTATATCGAAGAAAATGGCGCTGGTATCAACTACGTTCGTACTAACGACA 

ACGATTTAGCCTTTAACGATGCAAGCGCCTCTGGTGTCGGCGCTACAGCTGTAGGTTATAACGCTGTCGCGTCTGGTGCCAGCAGCGT 

AGCCATTGGTCAGAACAGCAGCAGCACCGTTGATACCGGTATTGCGCTGGGTAGCAGCTCCGTTTCCAGCCGTGTGATTGCCAAAGGT 

TCTCGTGACACTAGCGTAACGGAAAATGGCGTGGTGATTGGTTATGACACCACTGACGGCGAACTGCTAGGTGCATTGTCAATTGGTG 

ATGACGGTAAATACCGCCAAATCATCAACGTAGCTGATGGTTCAGAAGCCCATGACGCCGTTACGGTTCGCCAGTTGCAGAACGCTAT 

TGGAGCGGTCGCCACTACGCCAACCAAGTACTTTCACGCCAACTCAACGGCAGAAGACTCACTGGCCGTTGGTGAAGACTCGCTGGCA 

ATGGGTGCGAAAAC T GTCGTTAAT G G' j.\A >\TG C7 j GGTAT TGGTATCGGTTTGAAC ACTCTGGTT CTGGCTGATGCGATCAACCGCA'l' 'i! C 

CTATCGGCAGTAACGCACGCGCAP i ACCArCC^ A CLG JiV^CGC^VA'GGGTMCGGTTCTCAGACCACCCGTGGTGCACAGACTGGCT.n 

CACCGCCTACAACATGGACGCACCGCAGAACTCTGTAGGTGAGTTTTCTGTCGGTAGTGAAGACGGTCAACGTCAGATCACAAACGTC 

GCAGCTGGTTCAGCGGATACCGATGCGGTTAACGTGGGTCAGTTGAAAGTCACTGATGAGCGCGTAGCGCAAAATACCCAGAGCATTA 

CTAACCTGAACAATCAGGTCACTAATCTGGATACTCGCGTTACTAATATCGAAAACGGTATTGGCGACATTGTCACCACCGGTAGCAC 

CAAGTACTTCAAGACCAACACCGATGGCGTAGATGCCAACGCCCAGGGTAAAGATAGCGTTGCTATTGGTTCTGGTTCCATTGCTGCC 

GCTGACAACAGCGTCGCACTGGGTACCGGTTCCGTTGCAGAGGAAGAAAATACAATCTCTGTAGGTTCTTCTACTAACCAACGCCGGA 

TCACCAACGTAGCTGCCAGTGTTAATGCCACCGATGCGGTTAACGTGTCGCAGCTGAAATCTTCTGAAGCAGGCGGAGTGCGCTACGA 

CACCAAAGCTGATGGTTCTATCGACTATAGCAATATCACCCTCGGTGGTGGCAACGGCAGTACGACTCGTATCAGCAACGTCTCCGCT 

GG CG i CAACAACAACGACGCGGT GAAC r AC G CG Cz * i GAAGCAAAGCGCGCAGGAAACGAAGCAATACACCGATCAGCGGATGGTT G 

AGATGGATAACAAACTGTCTAAAACTGAAAGCAAGTTGAGCGGTGGTATCGCTTCTGCAATGGCAATGACCGGTCTGCCGCAGGCTTA 

TACACCGGGTGCCAGCATGGCTTCTATTGGTGGCGGTACTTACAACGGTGAATCGGCAGTTGCTTTAGGTGTATCGATGGTGAGCGCC 

AATGGTCGTTGGGTCTACAAATTACAAGGTAGTACCAATAGCCAGGGTGAATACTCCGCCGCACTCGGTGCCGGTATTCAGTGGTAA 

SEQIDNO: 32 (Brucella melitensis) 

ATGTCTTTTTTTAAAAAAAATATATCGATAACAGCCATGGGTGGCTTGATGTTGTCGCTTGCAGTCGACGCTGCGAAGGCGGAGGAAA 
ATGTTTCGCAGGTGAAACTTCCTCCTGTTTTTGTTTTTGAGCTTGTTGAAAACCAGGGGCTGGCCMTATTGCTCTGATTCGACCTCG 
AGTGATAGCGCCAGACAACAACCTACGTCCTGGGGGCATTGTGTCTGGGATCGCTGGTCTTTTGACTCTCGGGCAGGAGAACCGCAAC 
CTAATTAGTGAAAATAGGCAAGTTATTAATAACAATACGACTGCCATTGGCCAGAATCGTACCTCGATTAGTACCAATGCCAAGGGCG 
TTGCCGACAATAGGGCTGCTATCAGACAGAATAGTGCTGCGATTTCCGCTC.TTGGCCAGCGAGTTGACGGGTTGCAAGGTCAGATCAA 
CAGCGCCCGGAAAGAGGCACGTGCCGGAGCTGCAAATGCCGCTGCTCTTTCGGGGCTAAGATATGATAACCGTCCAGGCAAGGTGTCA 
ATCGCTACCGGTGTTGGCGGCTTTAAAGGTAGCACCGCTTTGGCTGCTGGTATCGGTTACACAAGCAAGAACGAAAACGCTCGTTATA 
ACGTGAGTGTAGCTTACAACGAAGCTGGCACATCCTGGAATGCAGGTGCAAGCTTTACGCTTAACTAA 

- -r . , , . , 

X*-.- >• M J. . \ J.- , . . 

' - V ✓ 

ATGTCTTTTTTTAAAAAAAATATATCGATAACAGCCATGGGTGGCTTGATGTTGTCGCTTGCAGTCGACGCTGCGAAGGCGGAGGAAA 
ATGTTTCGCAGGTGAAACTTCCTCCTGTTTTTGTTTTTGAGCTTGTTGAAAACCAGGGGCTGGCCAATATTGCTCTGATTCGACCTCG 
AGTGATAGCGCCAGACAACAACCTACGTCCTGGGGGCATCGTGTCTGGGATCGCTGGTCTTCTGACTCTCGGGCAGGAGAACCGCAAC 
CT AATTAGTGAAAAT AGGCAAGTTAT TAAT AACAATACGACTGCCAT TGGCCAGAACAGCGATCGGAT CGACGCCAATGCCAAAGGT G 
TTGCCGACAATAGGGCTGCCATTGGCCAGAACAGCGGCCGGATCGACGCCAATGCCAAGGGCGTTGCCGACAACAAGGCTGCCATTGG 
CCGAAACAGCGGTCGGATCGACGCCAATGCCAAGGGCGTTGCCGACAACAAGACTGCCATTGGTCGAAACAGTGGTCGGATCGATACC 
AATGCCAAAGGTGTTGCCGACAATAGGGCTGCCATCAGCCAAAACCGCGGTCGCATCAATGCTAACGCAGCAGGTGTTGCTAGCAATA 
GGGCTGCTATCAGACAGAATAGTGCTGCGATTTCCGCTCTTGGCCAGCGAGTTGACGGGTTGCAAGGTCAGATCAACAGCGCCCGGAA 
AGAGGCACGTGCCGGAGCTGCAAATGCCGCTGCTCTTTCGGGGCTAAGATATGATAACCGTCCAGGCAAGGTGTCAATCGCTACCGGT 
GTTGGCGGCTTTAAAGGTAGCACCGCTTTGGCTGCTGGTATCGGTTACACAAGCAAGAACGAAAACGCTCGTTATAACGTGAGTGTAG 
CTTACAACGAAGCTGGCACATCCTGGAATGCAGGTGCAAGCTTTACGCTTAACTAA 
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^p(? ID NO: 34 (Ralston ia solanacearum) 

GTGGTTTTCAGCGCCATGCCGCMTACGCTTGCGCAGAMTGTTGCTGCMAACGATCCGGGMCGAATTGTGGMGCGTGGGTGATG 

CATATGCCTGGGCGCGAGGCGATGGGTATTCGGGTTGTMGGTCGGTTACGMGCCGCMAAAATTTGGCAAAGGGCACAGCATTCGG 

CMTAGCCTGGGTCMCTATCGCCTGGTACGMCATTCTCGTCTACGGGTCGACGCTCCGTGCTGGTATGAATGACGAAGTGACACCC 

C T CGAT T CGATG AAT AT CGG T Go CCA i C i C G ACGXi G G G G ■' \G C Gi V ,G CG G CT T T C ACG GCG GTGTC GAT AT G?,AT A AT T C C G CCAT C A 

AAAACCTTGCTGACGGCACGCTGTCGGCCi'iCG. J ". / :" ' ' '^ n ^ r ' r -^ "^"a^.ctCACCA^CACGAACATCACCAA 
■,' • i * "" , "'i(?i\"i ri ur , pi\ r pr , ii a< ■ • ' ■ ' 1 * ■ - •< • ->■■■•. i - t- r«- •'-""•r , r*TN 7> r*r»i\p 

CTGGACGGTGACGCGGTGGACTTCAGCGGCAAGAAGCTGAGTGACAGCACGACGTTTTCGCGCAAGCTGACGGGTGTGGCGGAGGGGA 
CGTTGTCGGCGACGAGCACGGATGCGGTGAGCGGCAAGCAACTCTATACGACCAACCAGAACCTGAGCACGACCAACCAGAATCTGGC 
GGACACGMCMGAGCCTGGCCGAGACGAACA/\Gai\CGTGTCGGCGACCACG 

ATCAGCGGCGGCTCGGCGGGTCTGGTGCAGCAATCGGCCGCGGGCAAGGACATCACGGTGGCCAAGGACCTGGACGGTGAGGCGGTGG 

ACTTCAGCGGCAAGAAGCTGAGCGACAGCACGACGTTCTCGCGCAAGCTGACGGGTGTGGCGGAGGGGACGCTGTCGGCGACGAGCAC 

GGATGCGGTGAGCGGCAAGCAGCTCTATACGACGAACCAGAATCTGGCGAGTACCAACAAGGACCTGGCCAATACCAACACGCGCCTG 

ACGACGGCCGAGGGCAATCTGTCGTCGAACACGACGAGCATCACGAACCTGCAGAACACCATCAAGAACATCAGCGGCGGCTCGGCGG 

GTCTGGTGCAGCAATCGGCTGCGGGCAAGGACATCACCGTGGCCAAGGACCTGGACGGTGACGCGGTGGACTTCAGCGGCAAGAAGCT 

GAGCGACAGCACGACGTTCTCGCGCAAGCTGACGGGTGTTGCGGAGGGGACGTTGTCGGCGACGAGTACCGATGCGGTGAGCGGCAGG 

CAGCTCTATACGACCAACCAGAACCTGAGCACGACGAACCAGAATCTGGCGGACACGAACAAGAGCCTGGCCGAGACGAACAAGAACG 

TGTCGGCGACCACGACCMCATCACGAACCiGCAGi^iC^VCGG'JG^.iVC / : f.'ATC -.V •VU^GC,GO r,i TCGGCGGGTC r CC.G m GC^GCAATCGGC 

AGCGGGCAAGGACAT C AC AGTGGCCAAGG ACCTGG/iCGGTGACGCG G±G GAG a. \ G^G G GG Cr ,i-_Gi ulGCT Gi • ' ( ^\C; i£C2YCGACGTTC 

TCGCGCAAGCTGACGGGTGTGGCGGAGGGGACGCTGTCGGCGACGAGCACGGATGCGGTGAGCGGCAAGCAGCTCTATACGACCAACC 

AGAACCTGAGCACGACCAACCAGAATCTGGCGGACACGAACAAGAGCCTGGCCGAGACGAACAAGAACGTGTCGGCGACCACGACCAA 

CATCACGAACCTGCAGAACACGGTGAACAACATCAGCAGCGGTTCGGCGGGTCTGGTGCAGCAGTCGGCCGCGGGCAAGGACATCACG 

GTGGCCAAGAACCTGGACGGTGACGCGGTGGACTTCAGCGGCAAGAAGCTGAGTGACAGCACGACGTTTTCGCGCAAGCTGACGGGTG 

TGGCGGAGGGGACGCTGTCGGCGACGAGCACGGATGCGGTGAGCGGCAAGCAGCTCTATACGACCAACCAGAATCTGGCGAGTACCAA 

CAAGGACCTGGCCAATACCAACACGCGCCTGACGACGGCCGAGGGCAATCTGTCGTCGAACACGACGAGCATCACGAACCTGCAGAAC 

ACCATCAAGAACATCAGCGGCGGCTCGGCGGGTCTGGTGCAGCAATCGGCTGCGGGCAAGGACATCACCGTGGCCAAGGACCTGGACG 

GTGACGCGGTGGACTT CAGCGGCAAGAACCT GAGC G; ;C AGCACG. \CG l T CI 1 CG CG CAAGCT G ACGGGTGTT GCGG AGGGGACGTTGTC 

GGCGACGAGTACCGATGCGGTGAGCGGCAAGCAGCTCTATACGACCAACCAGAACCTGAGCACGACGAACCAGAATCTGGCGGACACG 

AACAAGAGCCTGGCCAAGACGAACAACAACGTGTCGGCGACCACGACCAACATCACGAACCTGCAGAACACGGTGAACAACATCAGCA 

GCGGTTCGGCGGGTCTGGTGCAGCAGTCGGCAGCGGGCAAGGACATTACGGTGGCCAAGGACCTGGACGGTGACGCGGTGGACTTCAG 

CGGCAAGAAGCTGAGCGACAGCACGACGTTCTCGCGCAAGCTGACGGGTGTGGCGGAGGGGACGCTGTCGGCGACGAGCACGGATGCG 

GTGAGCGGCAAGCAGCTCTATGCGACGAACCAGAATGTGTCGAAGTTGTCGGCGAACGTCACGGACGTGAGTGATAGCGTCACCAACA 

TCAAGAACACGATGAACACCATCGTGAACGGCGGCGGGCTCAAGTACTTCCACGCGAACTCGACGCTGGACGATGCGCAGGCGATGGG 

CCTCGAGTCGATCGCGTTCGGCGGCGCAGCCGTCGCGGCCGGTATGAACTCGATGGCGATGGGCGGCAATGCCCGGGCGGTGGCGGGC 

AACGCTGTGGCCTTGGGCGCGGGTTCGGTGGCGGACCGCGCGAACACGGTGTCGGTGGGCTCGGCGGGCAAGGAGCGCCAGATCACCA 

ACGTGGCGGCCGGTACGGCGGACACGGATGCCGTGAACGTTGCCCAGCTGAAGGCGGCCGGCATCATCMCGGCAGCGGCAGGACCAA 

CGCCACGGTGACGTACGGCACCAACGCAGACGGCTCGGCGGACTACGGCAACGTGACGCTGGGCGGCGGCAACGCGCCGGCCGGCACG 

GCGATCCACAACGTCGCGGCCGGCACGGCCGAGACCGACGCGGTGAACGTCAGGCAGATGAACGCGGCCATTGCCAGCGTGCAGMGG 

TGAGCAACACCAACGACCCGATGTTCGCGGCGGATGGCGACCGCGCTGTCAAGCGCGCGAGCGCCAAGGGCACGCATGCCACGGCGAT 

GGGTGCCGCGGCCAGCGCGGGCGGCGACC^GTCGGTCGCGACGGGCC^CA^CGCGCAGTCGGGCGGCGACAGCTCGGTCGCGATGGGC 

GCGAATGCGAAGGCGACGGCGMTC2*' f iGGGGi;.:G;:CG.:v.JGG-:G^ G I^Li;';^! /:G ^'JGGAilG:^.^ v^v GGOI GGGCTCGG 

CGGGCAGCGAGCGCCAGATCACCAATGTTGCGGCCCGTGTGCAGGGTACCGATGCGGTCAACGTGAGCCAGCTGAGCCAGGCGGTCTA 

TGCGGCCGTCGGCGATCTGCCGGCGGGCACGACGGCCAGGCAGTACACGGATGAGCAGATCGGCATGGTGCGGCAGGGGATCAGCCAG 

GTGGCGCGCGGCGCTTACAGCGGTATCGCGGCGGCGACCGCGCTGACGATGATTCCGGACGTCGATCAGGGCAAGTCGATCGCGATCG 

GTATCGGCAGCGCGACCTACAAGGGCTATCAGGCGGTTGCGCTGGGCGCCTCGGCACGCATCTCGCACAACCTGAAGGCCAAGATGGG 

CGTGGGCTACAGCAGCGAAGGCACGACGGTCGGCATGGGCGCGTCGTATCAGTGGTAA 

SEQ ID NO: 35 (Sinorhizobium meliloti) 

GTGGCGCTG^ACGCCAATCGGTCTCGGCCGGCAGCGGTTCACTTGCATTCGGTAACGGTAGTTACGCGAATTCAAACGGATCGGTTG 

CCATCGGGCAGTCTGCATACGCGGCCAATGTTCGGGCAATCGCAATCGGTGGAGATGATGCCTTCGCTTGGCGAGAGGCCGAGCAGAC 

CAAGGCCGGGGGCTCCCAGTCGATCGCCATGGGCGTGCGTGCGCGTACGAAATCGCTGGTAGTCGACGACCCGGATACGGTGGCCAAC 

GAAGCGGACCCGGGCGGCGCGTCTGATGCCATTGCAATCGGTACGGATGCACAGGCCAATGGCGACCGGTCTCTCGCCATCGGCAGAC 

AGMTCAGGCCGGC^TGAGCAATCCATCGGCATCGG^ 

CGTGGCCAGCGGCGTVGCMTCCCTAAGXJCTCGGCGCCGG^^ 

GCCGGCGGTCTCCGGTCGATCGCTTTCGGCGTGCGCGCGAGTACGAAAGAAGCCMTCTAGATATTCCGGATGACGTGGCCGCGATCG 
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KCCATCGCTATCGGTACCMTACCMGGCCMCGGCGACCGGTCCGTCAGCATCGGMCGGGCAGTCAAGCCAGCAGCGGAGCCGT 
TATTGGCGATGCAGCCAAGGCTGTGGGTGACAAATCCGTCAGTATCGGTACCGAAAGCTGGGCCGATGGCGACGAATCGGTCAGC 
ATCGGCCTCGTCMCAACGCCGGGTTTGAAGGGMTGACCGAATCAMGGCGGGCAAACCTCTGTCAGCCTGGGAGCCTTCAATCAGT 

CGCCGGGCMCGAGGCCftTTGCTATCGGTC^ 

GGCCG CLGax CCG GCGCAGGCGGACGGCGGTGCGCGCGACGCCGTCGCT ATAGGCACGGAx G CG CTG GC C AACG^.C GaCC G C CC A7 ' C 
AGCK,v.,.; ... • • ■ ' 'C" > T i 'IT' CCT CCCTG.AACGATT CCATCACC AT C:G G'j'.: VC CCGi\G CCACGTCGGGATCGGCGGGCGATATCATGA 
TCGGCACAGGiTUiGbG^^^ 

AAACATAGCGATCGGCGATTCGGCGGGCGGTTCCCGGGAAGGCGATAACAACGTCGCCATAGGCACCAATGCGGGAATCCAGTTTTCC 

GAGAGCGAACATGAGACCGCCGTGCGCGCCGACCTCGTGGTCAGTGACGCGGTGAGCATCGGCMTGAGGCGCTGGCGAGCGCCGATG 

AAGCCATCGCAATCGGCACCGGCGCCGTGGCTTCCGGTTTGAAGTCCATCAGCATCGGCGTCGGAAATACCGTCAGCGGCGCTTCGAG 

CGGCGCCATCGGCGATCCGACCGATATCACCGGTACCGGCTCCTACTCGCTGGGCAACGACAACACCATCGCCGCCGACAACGCCGGC 

ACCTTCGGCAACGACAACACATTGGCGGATGCCGCCGATGGCAGCCGCGTCATCGGCAACGGCAACAATATCGATGTGTCCGATGCCT 

TCGTGCTCGGCAATGGTGCCGACGTCACCGAAGTCGGCGGCGTGGCGCTGGGTTCCGGCTCGGTTTCGGATACGGGTGCCGACGTGGC 

GGGCTATGTGCCGGGCGGGGCCTCGACGGCCGACCAAMCGCCATCGAGGCGACGCAGAGCACGCGCGGCGCGGTCGCCGTCGGCAAT 

CCGGACGCCGAAACCGGCGTCTACCGCCAGATCACAGGTGTCGCCGCCGGCACGGCCGACTCCGATGCCGCCAACGTGGCCCAGCTCA 

AATCGGTCGAGACGATCGCCMGACAGGCTGGAAGCTCACGACGGACAGCGGCAGCATCGACGGCATCGGGCCGGGCGACGAGTTGGT 

CCTCAAAGGCGGCGACGGCAATATCGTGATCAGCAATCAGATCTTGAGCAACGACGTGAGCATCGATCTGGCCGATGAGATCGAGGTG 

AACAGGGT G^JJGG CG A G^G/ a' CCCGACACGGGTGCAT CCACGGT GCTGGACGRGA^CGG CCTGAGCTTCACGACGCAGGACG CGAA CG 

GGGAGGACACGGCGC'i'CGGCCCGCGCGTGACGGCGGCGGGCATCCAAGCGGCCGGCAAAATCACCAATGTCGCTGCGGGCGAGGCCGA 

CACCGACGCGGTGAACTTTTCCCAGCTCAGGCAAGTCGAGACCGCATCGGGCAATACCGACCAGCGGGCGGTCAAATATGACTGGACC 

GACGCCAATACGAATGGCGTGATCGATGAGGGCGAACTCAACCTCGATAGCGTGACCCTTGCCGGGGGCATGGGCGGCACCAGGATCT 

CCAATCTTGCGCCCGGCGCCTTGAGCGCGGCATCGACCGATGCCGTCAACGGCAGCCAGCTCTTCGGGCTTCGCAGCCGCGTATCCAA 

CGTGGCCGTCGCCCTGGGCGGGGGAGCCGCCTATGACCCTGTCAAGGATGAGTGGATCGCCCCGAAATACACGATCGGCGGCACCGAC 

TACAGCAATGTCGGCGACGCGCTGGCGGCGGTGGGCGGCACGGCCGGTGCCGGCTGGAGCCTCTCGGCGCAGGGTGCGAACGCGTCCA 

ATGTCGCGCCGGGCGAGACGGTGGATCTTCGCAGCGGCGACGGCAATATCGTCGTCAGCAAGGCGGAGACCGGCGACACCGTGAGCTT 

CGACCTGGCCGACGATCTGGACGTCTCCGZYAAGCATCACGGTGGGTGCCGATCCCGCCGATCCGAATGCGCCGACCACTGTTATCACC 

GGCGGTTCGATCGXGATCGGCAGCACCATGCTCGGCAGCAACGGCCTGGTCATCACCGGGGGGCCGAGCGTCACGACCGATGGCATCG 

ATGCCGGCGGCATGAAGGTCACGAATGTGGCAAACGGTACGGTGGCGAAGGATTCGAAGGATGCCGTCAACGGCGGCCAGCTCTTCGA 

CGTCGTTGCGAATGCCACTGCGAATGGCGTCGGCTATGACGACAAAAGCAAGGGCACCCTGACGCTGGAGGGGGCTAACGGCACCAAG 

ATCACC2\ATGTCGCCGCGGGCGACCTGAACGCGAACTCGACCGACGCGGTCAACGGCAGCCAGCTTTACGCCACGAACGTGAAGGTCG 

ATCGGCTCGATACCGAAGTGAAAGAGATCGACAGCCGCGTAACCTATATCGAGAGCTTCCAGGGCGATCTGGAGAACGCTGCCGTCTA 

TGATACCGACGCTGCCGGCAAGAGGCTCAACACGCTGACGCTCGAGGGAGGAGATCCCGACAAGCCGGTGCTCATCGCCAATGTGGCC 

AAGGGCGTGAAGGCGACCGACGCCGTCAATGTCGGCCAGCTCGACGAAAGCGTCGCGGAAAGCAAGAGCTACACGGACGAAAAGACCG 

AGTGGGCGATCGATCAGGCGGCCATCTACACCGACCAGGTTATCGAGACCAAGGTGAGCGCGGTGAACAATTATGCGCAACAACGGTT 

CGCGCAGCTCTCGGGCGAGATCGGGCAGGTTCGGAGCGAAGCGCGGCAAGCCGCCGCCATCGGACTTGCGGCGGCCTCGCTGCGCTTC 

GACAATGAGCCGGGCAAGCTGAGCGTGGCGCTCGGCGGCGGCTTCTGGAGAAGCGAAGGGGCGCTCGCCTTCGGTGCCGGCTACACCA 

GCGAAGACGGACGCGTCCGGGCGAACCTGACCGGTGCTGCGGCCGGGGGGAACGTCGGTGTCGGTGCCGGCCTCAGCATCACGCTCAA 

CTGA 

SEQIB NO: 36 (BradorMzobiMinm japoBkunmm) 

GTGCGGCUC.iCCGC::^ 

CGGTTGTCGGCTCCGGCAACGGCGTCACCGGCGACAACACCGCGGCCTTCGGCTCCAGTATCGGCATCGCCGGCGGCAACAACGCGGC 

TGTCGGCTCCTTCAGCACCGTCACCGGCAGCAACAGCGCGGCTGTTGGCTCCTTCAACAACGTCAGCGGCAATAACAGCGGCGCCTTT 

GGCACCGGACAGAACATCCGCGGTAACGGCACGTTCGCCATCGGCGATCCCAACATCGTCAACGGCAACAACAGTTTGGTGTTCGGCG 

ACAACAACACGGTGAACGGGTCCAATGTGGCGGGTCGCGGCGACAACATCCAGCTGGTCGGCTCGAACAACACCATTGCGGCAACGTC 

CAGCGCCGCCGGCTCATCCGTGTTCGGCAGCGGCAACACCGTCAACGCCACCAATGCCGTGGTGATGGGCAACAACAGCACCGTCTCC 

GGCGCGTCCTCTGTCGCGATCGGCAATGGCACGGCCGTTACCGGCATCAACGCGATCGCGATGGGCACCGGCGCCGGCGCGAATTTCG 

ACAACTCGGTCGCGATCGGCAGCGGCGCGACCACGACGCGCGCCAACCAGGTCGCAGTCGGCACCGCCAGCAGCACCTACACGATGAG 

CGGCATCACCTCGGCTGCGAGCAAGGCGGCGCAGTCCGGACCGACCCAGCTCGTCACATCGGACGCGGCCGGCAATCTGGCGACCACC 

TCGCTGGCTGGCCTCGGACTTGCCTCCGCCGGCGACATCAACGGAATCAACTCCCAACTGGCCGCCCTCAACGGCCGCGTCGACAACC 

TGACACGGGAGTCGCGCGGCGGCGTGGCGTTGGCGCTCGCGGCGTCCAGCCTGCAATTCGATCCTCGCCCCGGCAAGATCTCGGTCTC 

CGGCGGCTTCGGCAATTTCCAGGGACAATCCGGCCTCGCGGTCGGCCTCGGCTATTCCTATTCGGACGCCATGCGCTTCAACGCGGCG 

TTCACGGCCGCACAGCAAGGCGCCATCGGCGTCAGGGCCGGCGCGTCGTGGACGCTGAACTGA 



-43- 



1IW 



\ - . . \ k . : l > 



Coils uucHU't for 961 hi 
1 1 



0.8 



e.6 



0.4 



e. a 



7 



0 



uJl" 





t 



r' 



r 



r 



v 



W 



window- 14 
uindou=21 
window-28 




0 



50 



100 



150 



200 



850 



Coils output for 961 haeso 



T 



uindou=21 

iji ndouf-S.B 



0.8 



0.6 



0.4 



0.2 



0 



H 




i 




0 



58 



lee 



150 



800 



380 



358 



480 



450 



580 



2110 



I. w V .vVi-V LAY. : 



0.8 



0.6 



0.4 



0.2 



0 



0 



an 



50 



Coils output for 961 kl 
j 



T 




100 



150 



£00 



windou=£l 

ulncio'-.'-aS 



£50 



360 



350 



FIGURE 4 

Coils output for unknown 
1 



0.8 



8.6 



0.4 



0.8 



T 



n 




n 




wi ndo»J=£l 
window«£8 



nr 



L 



iee 



808 



300 



408 



600 



788 



881 



3110 



Coils output for 1 unknown 
1 1 



l 1 — 

uindou=2l 
uindow=28 



0-8 



0.6 



0.4 



0.2 



0 



J 



r 



0 



200 



J. 



480 608 



800 



1080 1200 1408 1600 1880 



FIGURE 6 



Coils output for unknown 
1 1 1 — 



window 14 
uindou»21 
uiindow-28 



0.8 



0.6 



0.4 



0.2 



0 



0 



XL 



286 



400 



n 




m H 



688 



880 



1000 



r 



1 



LI 



—i 

1280 



1408 



1680 



4110 



EFB<SURE 7 



Coils output for unknown 
1 



■j i ttdo«oi!4 
w i ndow=2 1 
windou=28 



8.8 



8.6 



8.4 



8.2 



8 



T TT 



1 fi II 



IL 



a 



JL 



JL 



1 



! 



y 



8 



288 488 688 888 1888 1288 1488 1688 1888 



FIGURE 8 



Coils output for unknown 



w$ nclou™ i 4 
uindow=21 
windows 28 



8.8 



8.6 



8.4 



n 



8. a 



8 



r 



i 



if 



8 



188 



see 



388 



488 



see 



688 



5110 



FIGURE © 



Coils output for unknown 




FIGURE 10 



Coils output for unknown 




0.8 



6110 



FIGURE 11 



Coils output for unknown 



window- 14 
wi ndou=2 1 
window=2S 



T 



0.6 



0.4 



0.2 - 



0 



£3L 



W 



0 



50 



100 



150 



800 



250 



300 



FIGURE 12 



Coils output for unknown 



8.8 



8.6 



0.4 



0.8 



0 



n 



LT 





I 



f 



u u 



windcv^H 
wi ndou=2l 
windou^SQ 



0 



800 



480 



600 



800 



1000 1800 1400 



7110 



Coils output for unknown 
1 1 



windaw=14 
window^Sl 
uindow-28 



e.e 



0.6 



8.4 



0.2 



0 



1 



e 



see 



400 



600 



800 



1000 



1200 



1400 



FIGURE 14 



Coils output for unknown 
— , 1 T- 



Wi ndow BS 14 
window~21 
wi ndow=28 



8.8 



8.6 



0.4 



8.2 



0 



0 



50 100 190 200 250 300 390 400 




8110 

FIGURE 15 




see ieee isea aeee 2see 3ee© 3soa 



FIGURE 16 



EGKl 
HAESO 



ski 



AGgG 

AGgG 
LHG§G 

gHg 



HHR 

■v. 



BUREO . ; 

HAD A 
. ACTAC . . 
:':EXBC/ "i 

Eipp:' . 

EIBP.: 

EIBA 

Etas 

USPA2 
YAljA 
DSRA 
BRUME 
BRTJjSU 
; BRUHE12 

BRUSJJ2 

* ' *■ *■ * • 

BRftjTA::;;V : GE 



R 
R 



m 

SI 



. ppkOTPp^ 

kipYNV§|^^ 



sV^F^GTHO^I^ 

^^FM\GTHTYI 
S.S^^SgGKWVVKG 

5A^R$Sfl^L] 
SM^y's^GRTWWkpQ 



lis 



ifiHpYsy 

PYSV 

p¥sy 
feysyl 
py.svI 
pybyj 

YDNR1 

OTRR! 
&DDR1 





iSMi- V S^fSRWVY KpQGiTN^QG E Ygjfl|£ 



.Gg^VGTSgGSS 

" "Itog.gs- 1 - 

GD— 

bo— 

GD — 

GD — 

GD — 

NTSGNKK 
|5 — 

NTYNGG- 
S-VAYNEAG 
-^VAYNEAG 

S-AAT^GGNWG 
rfcAftAGGNVG 






9110 



FIGURE 17 




FIGURE 20 




8000 



2380 2880 3000 



K1 



K12 




0006 



NADA-K1 



0004 



HYPO LIP 




r 




10110 
FIGURE 18 




FIGURE 19 




PCT/IB2004/002351 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 



lid BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 
Li LiiNJcS Ok MAKKS ON ORIGINAL DOCUMENT 



U REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 
□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




FADED TEXT OR DRAWING 




